Overview

Brought to you by YData

Dataset statistics

Number of variables68
Number of observations26789
Missing cells484275
Missing cells (%)26.6%
Total size in memory13.9 MiB
Average record size in memory544.0 B

Variable types

Text68

Dataset

DescriptionCUMV Amphibian and Reptile Collection 0000571-250121130708018
URLhttps://doi.org/10.15468/emivh3

Alerts

accessRights has constant value "http://vertnet.org/resources/norms.html" Constant
language has constant value "en" Constant
license has constant value "https://creativecommons.org/publicdomain/zero/1.0/" Constant
type has constant value "PhysicalObject" Constant
institutionID has constant value "http://grbio.org/cool/i64g-wjcr" Constant
collectionID has constant value "http://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw" Constant
institutionCode has constant value "CUMV" Constant
collectionCode has constant value "Herps" Constant
basisOfRecord has constant value "PreservedSpecimen" Constant
georeferenceVerificationStatus has constant value "requires verification" Constant
occurrenceStatus has constant value "present" Constant
identificationQualifier has constant value "sp." Constant
kingdom has constant value "Animalia" Constant
nomenclaturalCode has constant value "ICZN" Constant
dynamicProperties has 26491 (98.9%) missing values Missing
recordNumber has 16238 (60.6%) missing values Missing
sex has 26685 (99.6%) missing values Missing
lifeStage has 22252 (83.1%) missing values Missing
georeferenceVerificationStatus has 10270 (38.3%) missing values Missing
occurrenceRemarks has 17106 (63.9%) missing values Missing
eventDate has 1766 (6.6%) missing values Missing
startDayOfYear has 4508 (16.8%) missing values Missing
endDayOfYear has 4508 (16.8%) missing values Missing
year has 1766 (6.6%) missing values Missing
month has 2737 (10.2%) missing values Missing
day has 4508 (16.8%) missing values Missing
verbatimEventDate has 322 (1.2%) missing values Missing
habitat has 26694 (99.6%) missing values Missing
eventRemarks has 24414 (91.1%) missing values Missing
continent has 1793 (6.7%) missing values Missing
waterBody has 26775 (99.9%) missing values Missing
islandGroup has 25019 (93.4%) missing values Missing
island has 26322 (98.3%) missing values Missing
country has 1917 (7.2%) missing values Missing
countryCode has 1917 (7.2%) missing values Missing
stateProvince has 4766 (17.8%) missing values Missing
county has 9234 (34.5%) missing values Missing
municipality has 26785 (> 99.9%) missing values Missing
minimumElevationInMeters has 24884 (92.9%) missing values Missing
maximumElevationInMeters has 24884 (92.9%) missing values Missing
decimalLatitude has 10270 (38.3%) missing values Missing
decimalLongitude has 10270 (38.3%) missing values Missing
geodeticDatum has 10270 (38.3%) missing values Missing
coordinateUncertaintyInMeters has 15142 (56.5%) missing values Missing
georeferencedBy has 10456 (39.0%) missing values Missing
georeferencedDate has 13907 (51.9%) missing values Missing
identificationQualifier has 26788 (> 99.9%) missing values Missing
specificEpithet has 945 (3.5%) missing values Missing
infraspecificEpithet has 20731 (77.4%) missing values Missing
gbifID has unique values Unique
bibliographicCitation has unique values Unique
references has unique values Unique
occurrenceID has unique values Unique
catalogNumber has unique values Unique

Reproduction

Analysis started2025-01-23 23:14:22.337256
Analysis finished2025-01-23 23:14:24.398641
Duration2.06 seconds
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

gbifID
Text

Unique 

Distinct26789
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:24.572732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters267890
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26789 ?
Unique (%)100.0%

Sample

1st row1852462112
2nd row1852462111
3rd row1852462110
4th row1852462109
5th row1852462108
ValueCountFrequency (%)
1852462112 1
 
< 0.1%
1852462084 1
 
< 0.1%
1852462108 1
 
< 0.1%
1852462107 1
 
< 0.1%
1852462106 1
 
< 0.1%
1852462105 1
 
< 0.1%
1852462104 1
 
< 0.1%
1852462103 1
 
< 0.1%
1852462102 1
 
< 0.1%
1852462101 1
 
< 0.1%
Other values (26779) 26779
> 99.9%
2025-01-23T18:14:24.848564image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5 47619
17.8%
4 46845
17.5%
8 37757
14.1%
1 37672
14.1%
2 36909
13.8%
3 14820
 
5.5%
6 13076
 
4.9%
9 11068
 
4.1%
0 11065
 
4.1%
7 11059
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 267890
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
5 47619
17.8%
4 46845
17.5%
8 37757
14.1%
1 37672
14.1%
2 36909
13.8%
3 14820
 
5.5%
6 13076
 
4.9%
9 11068
 
4.1%
0 11065
 
4.1%
7 11059
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
Common 267890
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
5 47619
17.8%
4 46845
17.5%
8 37757
14.1%
1 37672
14.1%
2 36909
13.8%
3 14820
 
5.5%
6 13076
 
4.9%
9 11068
 
4.1%
0 11065
 
4.1%
7 11059
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 267890
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
5 47619
17.8%
4 46845
17.5%
8 37757
14.1%
1 37672
14.1%
2 36909
13.8%
3 14820
 
5.5%
6 13076
 
4.9%
9 11068
 
4.1%
0 11065
 
4.1%
7 11059
 
4.1%

accessRights
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:24.916314image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length39
Median length39
Mean length39
Min length39

Characters and Unicode

Total characters1044771
Distinct characters17
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://vertnet.org/resources/norms.html
2nd rowhttp://vertnet.org/resources/norms.html
3rd rowhttp://vertnet.org/resources/norms.html
4th rowhttp://vertnet.org/resources/norms.html
5th rowhttp://vertnet.org/resources/norms.html
ValueCountFrequency (%)
http://vertnet.org/resources/norms.html 26789
100.0%
2025-01-23T18:14:25.024729image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 133945
12.8%
t 133945
12.8%
/ 107156
10.3%
e 107156
10.3%
s 80367
7.7%
o 80367
7.7%
h 53578
 
5.1%
. 53578
 
5.1%
m 53578
 
5.1%
n 53578
 
5.1%
Other values (7) 187523
17.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 857248
82.1%
Other Punctuation 187523
 
17.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 133945
15.6%
t 133945
15.6%
e 107156
12.5%
s 80367
9.4%
o 80367
9.4%
h 53578
 
6.2%
m 53578
 
6.2%
n 53578
 
6.2%
v 26789
 
3.1%
g 26789
 
3.1%
Other values (4) 107156
12.5%
Other Punctuation
ValueCountFrequency (%)
/ 107156
57.1%
. 53578
28.6%
: 26789
 
14.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 857248
82.1%
Common 187523
 
17.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 133945
15.6%
t 133945
15.6%
e 107156
12.5%
s 80367
9.4%
o 80367
9.4%
h 53578
 
6.2%
m 53578
 
6.2%
n 53578
 
6.2%
v 26789
 
3.1%
g 26789
 
3.1%
Other values (4) 107156
12.5%
Common
ValueCountFrequency (%)
/ 107156
57.1%
. 53578
28.6%
: 26789
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1044771
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 133945
12.8%
t 133945
12.8%
/ 107156
10.3%
e 107156
10.3%
s 80367
7.7%
o 80367
7.7%
h 53578
 
5.1%
. 53578
 
5.1%
m 53578
 
5.1%
n 53578
 
5.1%
Other values (7) 187523
17.9%
Distinct26789
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:25.119334image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length37
Median length37
Mean length37
Min length37

Characters and Unicode

Total characters991193
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26789 ?
Unique (%)100.0%

Sample

1st row91f5620f-6148-4707-938a-b2a4ae2c7418.
2nd row30380134-d8e4-4799-ae86-3774458e3994.
3rd row06a12423-1e1e-483b-8637-27615d54247e.
4th rowecd7fc3c-0293-49a2-a315-1c603f586ebe.
5th row6581f514-9ead-4ab5-a0ad-06734e5bc1a1.
ValueCountFrequency (%)
91f5620f-6148-4707-938a-b2a4ae2c7418 1
 
< 0.1%
9931b7ee-f706-4fca-98bb-eefa52153a20 1
 
< 0.1%
6581f514-9ead-4ab5-a0ad-06734e5bc1a1 1
 
< 0.1%
bf599922-53e6-4d0c-b519-f63121432409 1
 
< 0.1%
d2dec992-d370-4428-8635-f1b407f69f7b 1
 
< 0.1%
6236d7d7-a86e-42fa-8c22-742d6aa13c68 1
 
< 0.1%
51972baf-d3a6-490c-b761-f6c8f28c8783 1
 
< 0.1%
b8df5068-de46-482f-8b48-d367f6a4ec88 1
 
< 0.1%
fc9cd2c9-3860-4de1-aaf1-cc7cdafb2000 1
 
< 0.1%
78b0897b-5e79-4184-8d17-51a9c287ac22 1
 
< 0.1%
Other values (26779) 26779
> 99.9%
2025-01-23T18:14:25.279450image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 169486
17.1%
6 115605
11.7%
- 107156
10.8%
1 90783
9.2%
a 90188
9.1%
3 63815
 
6.4%
9 62723
 
6.3%
e 39634
 
4.0%
2 37504
 
3.8%
8 37204
 
3.8%
Other values (8) 177095
17.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 659315
66.5%
Lowercase Letter 197933
 
20.0%
Dash Punctuation 107156
 
10.8%
Other Punctuation 26789
 
2.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 169486
25.7%
6 115605
17.5%
1 90783
13.8%
3 63815
 
9.7%
9 62723
 
9.5%
2 37504
 
5.7%
8 37204
 
5.6%
5 36458
 
5.5%
7 36089
 
5.5%
0 9648
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
a 90188
45.6%
e 39634
20.0%
d 31875
 
16.1%
c 15081
 
7.6%
f 11357
 
5.7%
b 9798
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 107156
100.0%
Other Punctuation
ValueCountFrequency (%)
. 26789
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 793260
80.0%
Latin 197933
 
20.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 169486
21.4%
6 115605
14.6%
- 107156
13.5%
1 90783
11.4%
3 63815
 
8.0%
9 62723
 
7.9%
2 37504
 
4.7%
8 37204
 
4.7%
5 36458
 
4.6%
7 36089
 
4.5%
Other values (2) 36437
 
4.6%
Latin
ValueCountFrequency (%)
a 90188
45.6%
e 39634
20.0%
d 31875
 
16.1%
c 15081
 
7.6%
f 11357
 
5.7%
b 9798
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 991193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 169486
17.1%
6 115605
11.7%
- 107156
10.8%
1 90783
9.2%
a 90188
9.1%
3 63815
 
6.4%
9 62723
 
6.3%
e 39634
 
4.0%
2 37504
 
3.8%
8 37204
 
3.8%
Other values (8) 177095
17.9%

language
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:25.338006image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters53578
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowen
2nd rowen
3rd rowen
4th rowen
5th rowen
ValueCountFrequency (%)
en 26789
100.0%
2025-01-23T18:14:25.438609image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 26789
50.0%
n 26789
50.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 53578
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 26789
50.0%
n 26789
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 53578
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 26789
50.0%
n 26789
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 53578
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 26789
50.0%
n 26789
50.0%

license
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:25.500414image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length50
Median length50
Mean length50
Min length50

Characters and Unicode

Total characters1339450
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttps://creativecommons.org/publicdomain/zero/1.0/
2nd rowhttps://creativecommons.org/publicdomain/zero/1.0/
3rd rowhttps://creativecommons.org/publicdomain/zero/1.0/
4th rowhttps://creativecommons.org/publicdomain/zero/1.0/
5th rowhttps://creativecommons.org/publicdomain/zero/1.0/
ValueCountFrequency (%)
https://creativecommons.org/publicdomain/zero/1.0 26789
100.0%
2025-01-23T18:14:25.631211image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 160734
 
12.0%
o 133945
 
10.0%
i 80367
 
6.0%
m 80367
 
6.0%
c 80367
 
6.0%
r 80367
 
6.0%
e 80367
 
6.0%
t 80367
 
6.0%
. 53578
 
4.0%
n 53578
 
4.0%
Other values (14) 455413
34.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1044771
78.0%
Other Punctuation 241101
 
18.0%
Decimal Number 53578
 
4.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 133945
12.8%
i 80367
 
7.7%
m 80367
 
7.7%
c 80367
 
7.7%
r 80367
 
7.7%
e 80367
 
7.7%
t 80367
 
7.7%
n 53578
 
5.1%
a 53578
 
5.1%
s 53578
 
5.1%
Other values (9) 267890
25.6%
Other Punctuation
ValueCountFrequency (%)
/ 160734
66.7%
. 53578
 
22.2%
: 26789
 
11.1%
Decimal Number
ValueCountFrequency (%)
1 26789
50.0%
0 26789
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1044771
78.0%
Common 294679
 
22.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 133945
12.8%
i 80367
 
7.7%
m 80367
 
7.7%
c 80367
 
7.7%
r 80367
 
7.7%
e 80367
 
7.7%
t 80367
 
7.7%
n 53578
 
5.1%
a 53578
 
5.1%
s 53578
 
5.1%
Other values (9) 267890
25.6%
Common
ValueCountFrequency (%)
/ 160734
54.5%
. 53578
 
18.2%
1 26789
 
9.1%
: 26789
 
9.1%
0 26789
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1339450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 160734
 
12.0%
o 133945
 
10.0%
i 80367
 
6.0%
m 80367
 
6.0%
c 80367
 
6.0%
r 80367
 
6.0%
e 80367
 
6.0%
t 80367
 
6.0%
. 53578
 
4.0%
n 53578
 
4.0%
Other values (14) 455413
34.0%
Distinct1345
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:25.772515image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length10
Mean length10.55515323
Min length10

Characters and Unicode

Total characters282762
Distinct characters14
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1339 ?
Unique (%)5.0%

Sample

1st row2017-04-06 09:34:35.0
2nd row2017-02-07 12:21:10.0
3rd row2017-02-07 12:24:35.0
4th row2017-04-06 11:32:53.0
5th row2017-04-06 10:38:59.0
ValueCountFrequency (%)
2018-08-04 25437
90.4%
2017-08-04 235
 
0.8%
2017-08-07 178
 
0.6%
2017-08-03 74
 
0.3%
2017-07-26 70
 
0.2%
2017-06-22 53
 
0.2%
2017-06-21 52
 
0.2%
2017-06-05 51
 
0.2%
2017-06-28 44
 
0.2%
2017-04-06 44
 
0.2%
Other values (1361) 1903
 
6.8%
2025-01-23T18:14:25.948347image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 82379
29.1%
- 53578
18.9%
8 51739
18.3%
1 29378
 
10.4%
2 28311
 
10.0%
4 26775
 
9.5%
: 2704
 
1.0%
7 1849
 
0.7%
1352
 
0.5%
. 1352
 
0.5%
Other values (4) 3345
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 223776
79.1%
Dash Punctuation 53578
 
18.9%
Other Punctuation 4056
 
1.4%
Space Separator 1352
 
0.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 82379
36.8%
8 51739
23.1%
1 29378
 
13.1%
2 28311
 
12.7%
4 26775
 
12.0%
7 1849
 
0.8%
3 1047
 
0.5%
5 958
 
0.4%
6 744
 
0.3%
9 596
 
0.3%
Other Punctuation
ValueCountFrequency (%)
: 2704
66.7%
. 1352
33.3%
Dash Punctuation
ValueCountFrequency (%)
- 53578
100.0%
Space Separator
ValueCountFrequency (%)
1352
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 282762
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 82379
29.1%
- 53578
18.9%
8 51739
18.3%
1 29378
 
10.4%
2 28311
 
10.0%
4 26775
 
9.5%
: 2704
 
1.0%
7 1849
 
0.7%
1352
 
0.5%
. 1352
 
0.5%
Other values (4) 3345
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 282762
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 82379
29.1%
- 53578
18.9%
8 51739
18.3%
1 29378
 
10.4%
2 28311
 
10.0%
4 26775
 
9.5%
: 2704
 
1.0%
7 1849
 
0.7%
1352
 
0.5%
. 1352
 
0.5%
Other values (4) 3345
 
1.2%

references
Text

Unique 

Distinct26789
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.069345image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length78
Median length78
Mean length78
Min length78

Characters and Unicode

Total characters2089542
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26789 ?
Unique (%)100.0%

Sample

1st rowhttp://portal.vertnet.org/o/cumv/herps?id=91f5620f-6148-4707-938a-b2a4ae2c7418
2nd rowhttp://portal.vertnet.org/o/cumv/herps?id=30380134-d8e4-4799-ae86-3774458e3994
3rd rowhttp://portal.vertnet.org/o/cumv/herps?id=06a12423-1e1e-483b-8637-27615d54247e
4th rowhttp://portal.vertnet.org/o/cumv/herps?id=ecd7fc3c-0293-49a2-a315-1c603f586ebe
5th rowhttp://portal.vertnet.org/o/cumv/herps?id=6581f514-9ead-4ab5-a0ad-06734e5bc1a1
ValueCountFrequency (%)
http://portal.vertnet.org/o/cumv/herps?id=91f5620f-6148-4707-938a-b2a4ae2c7418 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=9931b7ee-f706-4fca-98bb-eefa52153a20 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=6581f514-9ead-4ab5-a0ad-06734e5bc1a1 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=bf599922-53e6-4d0c-b519-f63121432409 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=d2dec992-d370-4428-8635-f1b407f69f7b 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=6236d7d7-a86e-42fa-8c22-742d6aa13c68 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=51972baf-d3a6-490c-b761-f6c8f28c8783 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=b8df5068-de46-482f-8b48-d367f6a4ec88 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=fc9cd2c9-3860-4de1-aaf1-cc7cdafb2000 1
 
< 0.1%
http://portal.vertnet.org/o/cumv/herps?id=78b0897b-5e79-4184-8d17-51a9c287ac22 1
 
< 0.1%
Other values (26779) 26779
> 99.9%
2025-01-23T18:14:26.254207image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 169486
 
8.1%
t 133945
 
6.4%
/ 133945
 
6.4%
e 120001
 
5.7%
a 116977
 
5.6%
6 115605
 
5.5%
- 107156
 
5.1%
r 107156
 
5.1%
1 90783
 
4.3%
o 80367
 
3.8%
Other values (25) 914121
43.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1055181
50.5%
Decimal Number 659315
31.6%
Other Punctuation 241101
 
11.5%
Dash Punctuation 107156
 
5.1%
Math Symbol 26789
 
1.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 133945
12.7%
e 120001
11.4%
a 116977
11.1%
r 107156
10.2%
o 80367
 
7.6%
p 80367
 
7.6%
d 58664
 
5.6%
v 53578
 
5.1%
h 53578
 
5.1%
c 41870
 
4.0%
Other values (9) 208678
19.8%
Decimal Number
ValueCountFrequency (%)
4 169486
25.7%
6 115605
17.5%
1 90783
13.8%
3 63815
 
9.7%
9 62723
 
9.5%
2 37504
 
5.7%
8 37204
 
5.6%
5 36458
 
5.5%
7 36089
 
5.5%
0 9648
 
1.5%
Other Punctuation
ValueCountFrequency (%)
/ 133945
55.6%
. 53578
 
22.2%
? 26789
 
11.1%
: 26789
 
11.1%
Dash Punctuation
ValueCountFrequency (%)
- 107156
100.0%
Math Symbol
ValueCountFrequency (%)
= 26789
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1055181
50.5%
Common 1034361
49.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 133945
12.7%
e 120001
11.4%
a 116977
11.1%
r 107156
10.2%
o 80367
 
7.6%
p 80367
 
7.6%
d 58664
 
5.6%
v 53578
 
5.1%
h 53578
 
5.1%
c 41870
 
4.0%
Other values (9) 208678
19.8%
Common
ValueCountFrequency (%)
4 169486
16.4%
/ 133945
12.9%
6 115605
11.2%
- 107156
10.4%
1 90783
8.8%
3 63815
 
6.2%
9 62723
 
6.1%
. 53578
 
5.2%
2 37504
 
3.6%
8 37204
 
3.6%
Other values (6) 162562
15.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2089542
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 169486
 
8.1%
t 133945
 
6.4%
/ 133945
 
6.4%
e 120001
 
5.7%
a 116977
 
5.6%
6 115605
 
5.5%
- 107156
 
5.1%
r 107156
 
5.1%
1 90783
 
4.3%
o 80367
 
3.8%
Other values (25) 914121
43.7%

type
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.314285image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters375046
Distinct characters13
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPhysicalObject
2nd rowPhysicalObject
3rd rowPhysicalObject
4th rowPhysicalObject
5th rowPhysicalObject
ValueCountFrequency (%)
physicalobject 26789
100.0%
2025-01-23T18:14:26.418334image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
c 53578
14.3%
P 26789
 
7.1%
h 26789
 
7.1%
y 26789
 
7.1%
s 26789
 
7.1%
i 26789
 
7.1%
a 26789
 
7.1%
l 26789
 
7.1%
O 26789
 
7.1%
b 26789
 
7.1%
Other values (3) 80367
21.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 321468
85.7%
Uppercase Letter 53578
 
14.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
c 53578
16.7%
h 26789
8.3%
y 26789
8.3%
s 26789
8.3%
i 26789
8.3%
a 26789
8.3%
l 26789
8.3%
b 26789
8.3%
j 26789
8.3%
e 26789
8.3%
Uppercase Letter
ValueCountFrequency (%)
P 26789
50.0%
O 26789
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 375046
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
c 53578
14.3%
P 26789
 
7.1%
h 26789
 
7.1%
y 26789
 
7.1%
s 26789
 
7.1%
i 26789
 
7.1%
a 26789
 
7.1%
l 26789
 
7.1%
O 26789
 
7.1%
b 26789
 
7.1%
Other values (3) 80367
21.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 375046
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
c 53578
14.3%
P 26789
 
7.1%
h 26789
 
7.1%
y 26789
 
7.1%
s 26789
 
7.1%
i 26789
 
7.1%
a 26789
 
7.1%
l 26789
 
7.1%
O 26789
 
7.1%
b 26789
 
7.1%
Other values (3) 80367
21.4%

institutionID
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.474290image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length31
Median length31
Mean length31
Min length31

Characters and Unicode

Total characters830459
Distinct characters18
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://grbio.org/cool/i64g-wjcr
2nd rowhttp://grbio.org/cool/i64g-wjcr
3rd rowhttp://grbio.org/cool/i64g-wjcr
4th rowhttp://grbio.org/cool/i64g-wjcr
5th rowhttp://grbio.org/cool/i64g-wjcr
ValueCountFrequency (%)
http://grbio.org/cool/i64g-wjcr 26789
100.0%
2025-01-23T18:14:26.585424image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 107156
12.9%
/ 107156
12.9%
g 80367
 
9.7%
r 80367
 
9.7%
i 53578
 
6.5%
t 53578
 
6.5%
c 53578
 
6.5%
l 26789
 
3.2%
w 26789
 
3.2%
- 26789
 
3.2%
Other values (8) 214312
25.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 589358
71.0%
Other Punctuation 160734
 
19.4%
Decimal Number 53578
 
6.5%
Dash Punctuation 26789
 
3.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 107156
18.2%
g 80367
13.6%
r 80367
13.6%
i 53578
9.1%
t 53578
9.1%
c 53578
9.1%
l 26789
 
4.5%
w 26789
 
4.5%
h 26789
 
4.5%
b 26789
 
4.5%
Other values (2) 53578
9.1%
Other Punctuation
ValueCountFrequency (%)
/ 107156
66.7%
. 26789
 
16.7%
: 26789
 
16.7%
Decimal Number
ValueCountFrequency (%)
4 26789
50.0%
6 26789
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 26789
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 589358
71.0%
Common 241101
29.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 107156
18.2%
g 80367
13.6%
r 80367
13.6%
i 53578
9.1%
t 53578
9.1%
c 53578
9.1%
l 26789
 
4.5%
w 26789
 
4.5%
h 26789
 
4.5%
b 26789
 
4.5%
Other values (2) 53578
9.1%
Common
ValueCountFrequency (%)
/ 107156
44.4%
- 26789
 
11.1%
4 26789
 
11.1%
6 26789
 
11.1%
. 26789
 
11.1%
: 26789
 
11.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 830459
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 107156
12.9%
/ 107156
12.9%
g 80367
 
9.7%
r 80367
 
9.7%
i 53578
 
6.5%
t 53578
 
6.5%
c 53578
 
6.5%
l 26789
 
3.2%
w 26789
 
3.2%
- 26789
 
3.2%
Other values (8) 214312
25.8%

collectionID
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.645730image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length65
Median length65
Mean length65
Min length65

Characters and Unicode

Total characters1741285
Distinct characters26
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw
2nd rowhttp://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw
3rd rowhttp://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw
4th rowhttp://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw
5th rowhttp://grbio.org/cool/fra1-wrdm | http://grbio.org/cool/vg5r-57jw
ValueCountFrequency (%)
http://grbio.org/cool/fra1-wrdm 26789
33.3%
26789
33.3%
http://grbio.org/cool/vg5r-57jw 26789
33.3%
2025-01-23T18:14:26.757454image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 214312
 
12.3%
o 214312
 
12.3%
r 187523
 
10.8%
g 133945
 
7.7%
t 107156
 
6.2%
h 53578
 
3.1%
c 53578
 
3.1%
w 53578
 
3.1%
- 53578
 
3.1%
5 53578
 
3.1%
Other values (16) 616147
35.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1178716
67.7%
Other Punctuation 321468
 
18.5%
Decimal Number 107156
 
6.2%
Dash Punctuation 53578
 
3.1%
Space Separator 53578
 
3.1%
Math Symbol 26789
 
1.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 214312
18.2%
r 187523
15.9%
g 133945
11.4%
t 107156
9.1%
h 53578
 
4.5%
c 53578
 
4.5%
w 53578
 
4.5%
l 53578
 
4.5%
i 53578
 
4.5%
b 53578
 
4.5%
Other values (7) 214312
18.2%
Other Punctuation
ValueCountFrequency (%)
/ 214312
66.7%
. 53578
 
16.7%
: 53578
 
16.7%
Decimal Number
ValueCountFrequency (%)
5 53578
50.0%
7 26789
25.0%
1 26789
25.0%
Dash Punctuation
ValueCountFrequency (%)
- 53578
100.0%
Space Separator
ValueCountFrequency (%)
53578
100.0%
Math Symbol
ValueCountFrequency (%)
| 26789
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1178716
67.7%
Common 562569
32.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 214312
18.2%
r 187523
15.9%
g 133945
11.4%
t 107156
9.1%
h 53578
 
4.5%
c 53578
 
4.5%
w 53578
 
4.5%
l 53578
 
4.5%
i 53578
 
4.5%
b 53578
 
4.5%
Other values (7) 214312
18.2%
Common
ValueCountFrequency (%)
/ 214312
38.1%
- 53578
 
9.5%
5 53578
 
9.5%
. 53578
 
9.5%
53578
 
9.5%
: 53578
 
9.5%
7 26789
 
4.8%
| 26789
 
4.8%
1 26789
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1741285
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 214312
 
12.3%
o 214312
 
12.3%
r 187523
 
10.8%
g 133945
 
7.7%
t 107156
 
6.2%
h 53578
 
3.1%
c 53578
 
3.1%
w 53578
 
3.1%
- 53578
 
3.1%
5 53578
 
3.1%
Other values (16) 616147
35.4%

institutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.802554image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters107156
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCUMV
2nd rowCUMV
3rd rowCUMV
4th rowCUMV
5th rowCUMV
ValueCountFrequency (%)
cumv 26789
100.0%
2025-01-23T18:14:26.896732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
C 26789
25.0%
U 26789
25.0%
M 26789
25.0%
V 26789
25.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 107156
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 26789
25.0%
U 26789
25.0%
M 26789
25.0%
V 26789
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 107156
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 26789
25.0%
U 26789
25.0%
M 26789
25.0%
V 26789
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 107156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
C 26789
25.0%
U 26789
25.0%
M 26789
25.0%
V 26789
25.0%

collectionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:26.938327image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters133945
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowHerps
2nd rowHerps
3rd rowHerps
4th rowHerps
5th rowHerps
ValueCountFrequency (%)
herps 26789
100.0%
2025-01-23T18:14:27.034192image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
H 26789
20.0%
e 26789
20.0%
r 26789
20.0%
p 26789
20.0%
s 26789
20.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 107156
80.0%
Uppercase Letter 26789
 
20.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 26789
25.0%
r 26789
25.0%
p 26789
25.0%
s 26789
25.0%
Uppercase Letter
ValueCountFrequency (%)
H 26789
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 133945
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
H 26789
20.0%
e 26789
20.0%
r 26789
20.0%
p 26789
20.0%
s 26789
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 133945
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
H 26789
20.0%
e 26789
20.0%
r 26789
20.0%
p 26789
20.0%
s 26789
20.0%

basisOfRecord
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:27.087373image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length17
Mean length17
Min length17

Characters and Unicode

Total characters455413
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPreservedSpecimen
2nd rowPreservedSpecimen
3rd rowPreservedSpecimen
4th rowPreservedSpecimen
5th rowPreservedSpecimen
ValueCountFrequency (%)
preservedspecimen 26789
100.0%
2025-01-23T18:14:27.211170image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 133945
29.4%
r 53578
 
11.8%
P 26789
 
5.9%
s 26789
 
5.9%
v 26789
 
5.9%
d 26789
 
5.9%
S 26789
 
5.9%
p 26789
 
5.9%
c 26789
 
5.9%
i 26789
 
5.9%
Other values (2) 53578
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 401835
88.2%
Uppercase Letter 53578
 
11.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 133945
33.3%
r 53578
 
13.3%
s 26789
 
6.7%
v 26789
 
6.7%
d 26789
 
6.7%
p 26789
 
6.7%
c 26789
 
6.7%
i 26789
 
6.7%
m 26789
 
6.7%
n 26789
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
P 26789
50.0%
S 26789
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 455413
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 133945
29.4%
r 53578
 
11.8%
P 26789
 
5.9%
s 26789
 
5.9%
v 26789
 
5.9%
d 26789
 
5.9%
S 26789
 
5.9%
p 26789
 
5.9%
c 26789
 
5.9%
i 26789
 
5.9%
Other values (2) 53578
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 455413
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 133945
29.4%
r 53578
 
11.8%
P 26789
 
5.9%
s 26789
 
5.9%
v 26789
 
5.9%
d 26789
 
5.9%
S 26789
 
5.9%
p 26789
 
5.9%
c 26789
 
5.9%
i 26789
 
5.9%
Other values (2) 53578
 
11.8%

dynamicProperties
Text

Missing 

Distinct275
Distinct (%)92.3%
Missing26491
Missing (%)98.9%
Memory size209.4 KiB
2025-01-23T18:14:27.340293image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length72
Median length71
Mean length45.4966443
Min length31

Characters and Unicode

Total characters13558
Distinct characters35
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique254 ?
Unique (%)85.2%

Sample

1st row{"snout vent length in mm":"92.7","weight":"80.4","weight units":"g" }
2nd row{"snout vent length in mm":"95.3","weight":"87.4","weight units":"g" }
3rd row{"snout vent length in mm":"85.1","weight":"72.3","weight units":"g" }
4th row{"snout vent length in mm":"110","weight":"121","weight units":"g" }
5th row{"snout vent length in mm":"109","weight":"124.6","weight units":"g" }
ValueCountFrequency (%)
298
15.9%
in 294
15.7%
snout 293
15.6%
length 293
15.6%
vent 293
15.6%
units":"g 101
 
5.4%
mm 4
 
0.2%
mm":"109.5 3
 
0.2%
mm":"65 3
 
0.2%
mm":"92.8 2
 
0.1%
Other values (275) 294
15.7%
2025-01-23T18:14:27.532533image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 1924
14.2%
1580
 
11.7%
n 1279
 
9.4%
t 1169
 
8.6%
e 783
 
5.8%
m 602
 
4.4%
g 579
 
4.3%
i 579
 
4.3%
: 481
 
3.5%
h 478
 
3.5%
Other values (25) 4104
30.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 7341
54.1%
Other Punctuation 2847
 
21.0%
Space Separator 1580
 
11.7%
Decimal Number 1188
 
8.8%
Open Punctuation 298
 
2.2%
Close Punctuation 298
 
2.2%
Uppercase Letter 6
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 1279
17.4%
t 1169
15.9%
e 783
10.7%
m 602
8.2%
g 579
7.9%
i 579
7.9%
h 478
 
6.5%
s 402
 
5.5%
u 398
 
5.4%
l 294
 
4.0%
Other values (5) 778
10.6%
Decimal Number
ValueCountFrequency (%)
1 288
24.2%
9 128
10.8%
0 112
 
9.4%
7 104
 
8.8%
6 104
 
8.8%
2 102
 
8.6%
3 94
 
7.9%
5 94
 
7.9%
8 91
 
7.7%
4 71
 
6.0%
Other Punctuation
ValueCountFrequency (%)
" 1924
67.6%
: 481
 
16.9%
. 258
 
9.1%
, 184
 
6.5%
Uppercase Letter
ValueCountFrequency (%)
S 2
33.3%
U 2
33.3%
L 2
33.3%
Space Separator
ValueCountFrequency (%)
1580
100.0%
Open Punctuation
ValueCountFrequency (%)
{ 298
100.0%
Close Punctuation
ValueCountFrequency (%)
} 298
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 7347
54.2%
Common 6211
45.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 1279
17.4%
t 1169
15.9%
e 783
10.7%
m 602
8.2%
g 579
7.9%
i 579
7.9%
h 478
 
6.5%
s 402
 
5.5%
u 398
 
5.4%
l 294
 
4.0%
Other values (8) 784
10.7%
Common
ValueCountFrequency (%)
" 1924
31.0%
1580
25.4%
: 481
 
7.7%
{ 298
 
4.8%
} 298
 
4.8%
1 288
 
4.6%
. 258
 
4.2%
, 184
 
3.0%
9 128
 
2.1%
0 112
 
1.8%
Other values (7) 660
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13558
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 1924
14.2%
1580
 
11.7%
n 1279
 
9.4%
t 1169
 
8.6%
e 783
 
5.8%
m 602
 
4.4%
g 579
 
4.3%
i 579
 
4.3%
: 481
 
3.5%
h 478
 
3.5%
Other values (25) 4104
30.3%

occurrenceID
Text

Unique 

Distinct26789
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:27.636732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length36
Median length36
Mean length36
Min length36

Characters and Unicode

Total characters964404
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26789 ?
Unique (%)100.0%

Sample

1st row91f5620f-6148-4707-938a-b2a4ae2c7418
2nd row30380134-d8e4-4799-ae86-3774458e3994
3rd row06a12423-1e1e-483b-8637-27615d54247e
4th rowecd7fc3c-0293-49a2-a315-1c603f586ebe
5th row6581f514-9ead-4ab5-a0ad-06734e5bc1a1
ValueCountFrequency (%)
91f5620f-6148-4707-938a-b2a4ae2c7418 1
 
< 0.1%
9931b7ee-f706-4fca-98bb-eefa52153a20 1
 
< 0.1%
6581f514-9ead-4ab5-a0ad-06734e5bc1a1 1
 
< 0.1%
bf599922-53e6-4d0c-b519-f63121432409 1
 
< 0.1%
d2dec992-d370-4428-8635-f1b407f69f7b 1
 
< 0.1%
6236d7d7-a86e-42fa-8c22-742d6aa13c68 1
 
< 0.1%
51972baf-d3a6-490c-b761-f6c8f28c8783 1
 
< 0.1%
b8df5068-de46-482f-8b48-d367f6a4ec88 1
 
< 0.1%
fc9cd2c9-3860-4de1-aaf1-cc7cdafb2000 1
 
< 0.1%
78b0897b-5e79-4184-8d17-51a9c287ac22 1
 
< 0.1%
Other values (26779) 26779
> 99.9%
2025-01-23T18:14:27.800549image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 169486
17.6%
6 115605
12.0%
- 107156
11.1%
1 90783
9.4%
a 90188
9.4%
3 63815
 
6.6%
9 62723
 
6.5%
e 39634
 
4.1%
2 37504
 
3.9%
8 37204
 
3.9%
Other values (7) 150306
15.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 659315
68.4%
Lowercase Letter 197933
 
20.5%
Dash Punctuation 107156
 
11.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 169486
25.7%
6 115605
17.5%
1 90783
13.8%
3 63815
 
9.7%
9 62723
 
9.5%
2 37504
 
5.7%
8 37204
 
5.6%
5 36458
 
5.5%
7 36089
 
5.5%
0 9648
 
1.5%
Lowercase Letter
ValueCountFrequency (%)
a 90188
45.6%
e 39634
20.0%
d 31875
 
16.1%
c 15081
 
7.6%
f 11357
 
5.7%
b 9798
 
5.0%
Dash Punctuation
ValueCountFrequency (%)
- 107156
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 766471
79.5%
Latin 197933
 
20.5%

Most frequent character per script

Common
ValueCountFrequency (%)
4 169486
22.1%
6 115605
15.1%
- 107156
14.0%
1 90783
11.8%
3 63815
 
8.3%
9 62723
 
8.2%
2 37504
 
4.9%
8 37204
 
4.9%
5 36458
 
4.8%
7 36089
 
4.7%
Latin
ValueCountFrequency (%)
a 90188
45.6%
e 39634
20.0%
d 31875
 
16.1%
c 15081
 
7.6%
f 11357
 
5.7%
b 9798
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 964404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 169486
17.6%
6 115605
12.0%
- 107156
11.1%
1 90783
9.4%
a 90188
9.4%
3 63815
 
6.6%
9 62723
 
6.5%
e 39634
 
4.1%
2 37504
 
3.9%
8 37204
 
3.9%
Other values (7) 150306
15.6%

catalogNumber
Text

Unique 

Distinct26789
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:28.034043image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters241101
Distinct characters14
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26789 ?
Unique (%)100.0%

Sample

1st rowa-0016401
2nd rowa-0016268
3rd rowa-0016264
4th rowa-0016429
5th rowa-0016405
ValueCountFrequency (%)
a-0016401 1
 
< 0.1%
a-0016341 1
 
< 0.1%
a-0016405 1
 
< 0.1%
a-0016305 1
 
< 0.1%
a-0016287 1
 
< 0.1%
a-0016379 1
 
< 0.1%
a-0016432 1
 
< 0.1%
a-0016408 1
 
< 0.1%
a-0016263 1
 
< 0.1%
a-0016404 1
 
< 0.1%
Other values (26779) 26779
> 99.9%
2025-01-23T18:14:28.315125image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 82570
34.2%
- 26789
 
11.1%
1 21713
 
9.0%
A 13379
 
5.5%
R 13095
 
5.4%
2 11708
 
4.9%
3 11652
 
4.8%
4 11642
 
4.8%
5 10937
 
4.5%
9 10075
 
4.2%
Other values (4) 27541
 
11.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 187523
77.8%
Dash Punctuation 26789
 
11.1%
Uppercase Letter 26474
 
11.0%
Lowercase Letter 315
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 82570
44.0%
1 21713
 
11.6%
2 11708
 
6.2%
3 11652
 
6.2%
4 11642
 
6.2%
5 10937
 
5.8%
9 10075
 
5.4%
6 9676
 
5.2%
8 9203
 
4.9%
7 8347
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
A 13379
50.5%
R 13095
49.5%
Dash Punctuation
ValueCountFrequency (%)
- 26789
100.0%
Lowercase Letter
ValueCountFrequency (%)
a 315
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 214312
88.9%
Latin 26789
 
11.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 82570
38.5%
- 26789
 
12.5%
1 21713
 
10.1%
2 11708
 
5.5%
3 11652
 
5.4%
4 11642
 
5.4%
5 10937
 
5.1%
9 10075
 
4.7%
6 9676
 
4.5%
8 9203
 
4.3%
Latin
ValueCountFrequency (%)
A 13379
49.9%
R 13095
48.9%
a 315
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 241101
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 82570
34.2%
- 26789
 
11.1%
1 21713
 
9.0%
A 13379
 
5.5%
R 13095
 
5.4%
2 11708
 
4.9%
3 11652
 
4.8%
4 11642
 
4.8%
5 10937
 
4.5%
9 10075
 
4.2%
Other values (4) 27541
 
11.4%

recordNumber
Text

Missing 

Distinct9698
Distinct (%)91.9%
Missing16238
Missing (%)60.6%
Memory size209.4 KiB
2025-01-23T18:14:28.510145image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length30
Mean length6.893564591
Min length1

Characters and Unicode

Total characters72734
Distinct characters73
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9338 ?
Unique (%)88.5%

Sample

1st rowCSM 407
2nd rowCSM 194
3rd rowCSM 198
4th rowCSM 390
5th rowCSM 376
ValueCountFrequency (%)
pr 945
 
4.7%
ahw 700
 
3.5%
fhp 626
 
3.1%
dlr 576
 
2.9%
jmr 564
 
2.8%
hp 481
 
2.4%
kka 422
 
2.1%
kz 324
 
1.6%
csm 312
 
1.5%
jda 304
 
1.5%
Other values (4874) 14890
73.9%
2025-01-23T18:14:28.776345image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9597
 
13.2%
1 6235
 
8.6%
2 4569
 
6.3%
0 4035
 
5.5%
3 3419
 
4.7%
4 3300
 
4.5%
R 3223
 
4.4%
5 2815
 
3.9%
7 2751
 
3.8%
6 2584
 
3.6%
Other values (63) 30206
41.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 34705
47.7%
Uppercase Letter 26692
36.7%
Space Separator 9597
 
13.2%
Dash Punctuation 871
 
1.2%
Lowercase Letter 566
 
0.8%
Other Punctuation 282
 
0.4%
Close Punctuation 8
 
< 0.1%
Open Punctuation 8
 
< 0.1%
Math Symbol 4
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
R 3223
12.1%
H 2331
 
8.7%
P 2304
 
8.6%
M 2141
 
8.0%
A 2087
 
7.8%
S 1732
 
6.5%
W 1730
 
6.5%
D 1637
 
6.1%
J 1561
 
5.8%
C 1406
 
5.3%
Other values (16) 6540
24.5%
Lowercase Letter
ValueCountFrequency (%)
a 101
17.8%
u 58
10.2%
e 56
9.9%
t 51
9.0%
p 39
 
6.9%
c 33
 
5.8%
o 33
 
5.8%
l 31
 
5.5%
h 27
 
4.8%
z 24
 
4.2%
Other values (12) 113
20.0%
Decimal Number
ValueCountFrequency (%)
1 6235
18.0%
2 4569
13.2%
0 4035
11.6%
3 3419
9.9%
4 3300
9.5%
5 2815
8.1%
7 2751
7.9%
6 2584
7.4%
9 2573
7.4%
8 2424
 
7.0%
Other Punctuation
ValueCountFrequency (%)
. 107
37.9%
, 59
20.9%
# 56
19.9%
& 23
 
8.2%
? 13
 
4.6%
: 9
 
3.2%
; 8
 
2.8%
/ 5
 
1.8%
* 2
 
0.7%
Space Separator
ValueCountFrequency (%)
9597
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 871
100.0%
Close Punctuation
ValueCountFrequency (%)
) 8
100.0%
Open Punctuation
ValueCountFrequency (%)
( 8
100.0%
Math Symbol
ValueCountFrequency (%)
+ 4
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 45476
62.5%
Latin 27258
37.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
R 3223
11.8%
H 2331
 
8.6%
P 2304
 
8.5%
M 2141
 
7.9%
A 2087
 
7.7%
S 1732
 
6.4%
W 1730
 
6.3%
D 1637
 
6.0%
J 1561
 
5.7%
C 1406
 
5.2%
Other values (38) 7106
26.1%
Common
ValueCountFrequency (%)
9597
21.1%
1 6235
13.7%
2 4569
10.0%
0 4035
8.9%
3 3419
 
7.5%
4 3300
 
7.3%
5 2815
 
6.2%
7 2751
 
6.0%
6 2584
 
5.7%
9 2573
 
5.7%
Other values (15) 3598
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 72734
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9597
 
13.2%
1 6235
 
8.6%
2 4569
 
6.3%
0 4035
 
5.5%
3 3419
 
4.7%
4 3300
 
4.5%
R 3223
 
4.4%
5 2815
 
3.9%
7 2751
 
3.8%
6 2584
 
3.6%
Other values (63) 30206
41.5%
Distinct2508
Distinct (%)9.4%
Missing41
Missing (%)0.2%
Memory size209.4 KiB
2025-01-23T18:14:28.978291image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length107
Median length85
Mean length17.5682294
Min length2

Characters and Unicode

Total characters469915
Distinct characters66
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1307 ?
Unique (%)4.9%

Sample

1st rowCinnamon Mittan
2nd rowMayra Oyervides; Chris Mock
3rd rowMayra Oyervides; Chris Mock
4th rowCinnamon Mittan
5th rowCinnamon Mittan
ValueCountFrequency (%)
wright 4137
 
4.9%
albert 3577
 
4.3%
hazen 3570
 
4.3%
j 3557
 
4.2%
unknown 2688
 
3.2%
c 2372
 
2.8%
m 2357
 
2.8%
r 1884
 
2.2%
a 1658
 
2.0%
f 1645
 
2.0%
Other values (1905) 56354
67.2%
2025-01-23T18:14:29.254984image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
57616
 
12.3%
e 35220
 
7.5%
n 30266
 
6.4%
r 27257
 
5.8%
a 26906
 
5.7%
. 25732
 
5.5%
l 24355
 
5.2%
o 21037
 
4.5%
t 19766
 
4.2%
i 19390
 
4.1%
Other values (56) 182370
38.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 298546
63.5%
Uppercase Letter 81190
 
17.3%
Space Separator 57616
 
12.3%
Other Punctuation 32166
 
6.8%
Other Symbol 197
 
< 0.1%
Dash Punctuation 177
 
< 0.1%
Decimal Number 21
 
< 0.1%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 35220
11.8%
n 30266
10.1%
r 27257
 
9.1%
a 26906
 
9.0%
l 24355
 
8.2%
o 21037
 
7.0%
t 19766
 
6.6%
i 19390
 
6.5%
h 11530
 
3.9%
u 10095
 
3.4%
Other values (16) 72724
24.4%
Uppercase Letter
ValueCountFrequency (%)
H 9077
11.2%
A 8717
10.7%
W 7825
 
9.6%
R 6577
 
8.1%
C 5757
 
7.1%
M 5517
 
6.8%
J 5512
 
6.8%
P 3868
 
4.8%
F 3340
 
4.1%
S 3337
 
4.1%
Other values (15) 21663
26.7%
Other Punctuation
ValueCountFrequency (%)
. 25732
80.0%
; 6355
 
19.8%
' 50
 
0.2%
, 27
 
0.1%
? 1
 
< 0.1%
& 1
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
8 10
47.6%
5 5
23.8%
2 3
 
14.3%
3 3
 
14.3%
Space Separator
ValueCountFrequency (%)
57616
100.0%
Other Symbol
ValueCountFrequency (%)
197
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 177
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 379736
80.8%
Common 90179
 
19.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 35220
 
9.3%
n 30266
 
8.0%
r 27257
 
7.2%
a 26906
 
7.1%
l 24355
 
6.4%
o 21037
 
5.5%
t 19766
 
5.2%
i 19390
 
5.1%
h 11530
 
3.0%
u 10095
 
2.7%
Other values (41) 153914
40.5%
Common
ValueCountFrequency (%)
57616
63.9%
. 25732
28.5%
; 6355
 
7.0%
197
 
0.2%
- 177
 
0.2%
' 50
 
0.1%
, 27
 
< 0.1%
8 10
 
< 0.1%
5 5
 
< 0.1%
2 3
 
< 0.1%
Other values (5) 7
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 469718
> 99.9%
Specials 197
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
57616
 
12.3%
e 35220
 
7.5%
n 30266
 
6.4%
r 27257
 
5.8%
a 26906
 
5.7%
. 25732
 
5.5%
l 24355
 
5.2%
o 21037
 
4.5%
t 19766
 
4.2%
i 19390
 
4.1%
Other values (55) 182173
38.8%
Specials
ValueCountFrequency (%)
197
100.0%

sex
Text

Missing 

Distinct4
Distinct (%)3.8%
Missing26685
Missing (%)99.6%
Memory size209.4 KiB
2025-01-23T18:14:29.315447image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.403846154
Min length4

Characters and Unicode

Total characters562
Distinct characters15
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowfemale
2nd rowmale
3rd rowmale
4th rowmale
5th rowunknown
ValueCountFrequency (%)
male 52
49.1%
unknown 32
30.2%
female 18
 
17.0%
in 2
 
1.9%
question 2
 
1.9%
2025-01-23T18:14:29.422385image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 100
17.8%
e 90
16.0%
m 70
12.5%
a 70
12.5%
l 70
12.5%
u 34
 
6.0%
o 34
 
6.0%
k 32
 
5.7%
w 32
 
5.7%
f 18
 
3.2%
Other values (5) 12
 
2.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 560
99.6%
Space Separator 2
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 100
17.9%
e 90
16.1%
m 70
12.5%
a 70
12.5%
l 70
12.5%
u 34
 
6.1%
o 34
 
6.1%
k 32
 
5.7%
w 32
 
5.7%
f 18
 
3.2%
Other values (4) 10
 
1.8%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 560
99.6%
Common 2
 
0.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 100
17.9%
e 90
16.1%
m 70
12.5%
a 70
12.5%
l 70
12.5%
u 34
 
6.1%
o 34
 
6.1%
k 32
 
5.7%
w 32
 
5.7%
f 18
 
3.2%
Other values (4) 10
 
1.8%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 562
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 100
17.8%
e 90
16.0%
m 70
12.5%
a 70
12.5%
l 70
12.5%
u 34
 
6.0%
o 34
 
6.0%
k 32
 
5.7%
w 32
 
5.7%
f 18
 
3.2%
Other values (5) 12
 
2.1%

lifeStage
Text

Missing 

Distinct28
Distinct (%)0.6%
Missing22252
Missing (%)83.1%
Memory size209.4 KiB
2025-01-23T18:14:29.477587image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length20
Median length5
Mean length6.846815076
Min length3

Characters and Unicode

Total characters31064
Distinct characters22
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.1%

Sample

1st rownot recorded
2nd rownot recorded
3rd rowjuvenile
4th rowjuvenile
5th rownot recorded
ValueCountFrequency (%)
adult 2239
42.1%
juvenile 822
 
15.5%
larva 707
 
13.3%
not 531
 
10.0%
recorded 531
 
10.0%
egg 220
 
4.1%
metamorph 130
 
2.4%
immature 74
 
1.4%
embryo 34
 
0.6%
hatchling 21
 
0.4%
Other values (2) 6
 
0.1%
2025-01-23T18:14:29.600624image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 3884
12.5%
l 3799
12.2%
d 3306
10.6%
e 3176
10.2%
u 3145
10.1%
t 3001
9.7%
r 2007
6.5%
v 1534
 
4.9%
n 1381
 
4.4%
o 1227
 
3.9%
Other values (12) 4604
14.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 30034
96.7%
Space Separator 778
 
2.5%
Other Punctuation 252
 
0.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 3884
12.9%
l 3799
12.6%
d 3306
11.0%
e 3176
10.6%
u 3145
10.5%
t 3001
10.0%
r 2007
6.7%
v 1534
 
5.1%
n 1381
 
4.6%
o 1227
 
4.1%
Other values (9) 3574
11.9%
Other Punctuation
ValueCountFrequency (%)
, 247
98.0%
/ 5
 
2.0%
Space Separator
ValueCountFrequency (%)
778
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 30034
96.7%
Common 1030
 
3.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 3884
12.9%
l 3799
12.6%
d 3306
11.0%
e 3176
10.6%
u 3145
10.5%
t 3001
10.0%
r 2007
6.7%
v 1534
 
5.1%
n 1381
 
4.6%
o 1227
 
4.1%
Other values (9) 3574
11.9%
Common
ValueCountFrequency (%)
778
75.5%
, 247
 
24.0%
/ 5
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 31064
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 3884
12.5%
l 3799
12.2%
d 3306
10.6%
e 3176
10.2%
u 3145
10.1%
t 3001
9.7%
r 2007
6.5%
v 1534
 
4.9%
n 1381
 
4.4%
o 1227
 
3.9%
Other values (12) 4604
14.8%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:29.646319image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.010078764
Min length6

Characters and Unicode

Total characters161004
Distinct characters9
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownative
2nd rownative
3rd rownative
4th rownative
5th rownative
ValueCountFrequency (%)
native 26519
99.0%
managed 270
 
1.0%
2025-01-23T18:14:29.747297image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 27059
16.8%
n 26789
16.6%
e 26789
16.6%
t 26519
16.5%
i 26519
16.5%
v 26519
16.5%
m 270
 
0.2%
g 270
 
0.2%
d 270
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 161004
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 27059
16.8%
n 26789
16.6%
e 26789
16.6%
t 26519
16.5%
i 26519
16.5%
v 26519
16.5%
m 270
 
0.2%
g 270
 
0.2%
d 270
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 161004
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 27059
16.8%
n 26789
16.6%
e 26789
16.6%
t 26519
16.5%
i 26519
16.5%
v 26519
16.5%
m 270
 
0.2%
g 270
 
0.2%
d 270
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 161004
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 27059
16.8%
n 26789
16.6%
e 26789
16.6%
t 26519
16.5%
i 26519
16.5%
v 26519
16.5%
m 270
 
0.2%
g 270
 
0.2%
d 270
 
0.2%

georeferenceVerificationStatus
Text

Constant  Missing 

Distinct1
Distinct (%)< 0.1%
Missing10270
Missing (%)38.3%
Memory size209.4 KiB
2025-01-23T18:14:29.795383image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length21
Mean length21
Min length21

Characters and Unicode

Total characters346899
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowrequires verification
2nd rowrequires verification
3rd rowrequires verification
4th rowrequires verification
5th rowrequires verification
ValueCountFrequency (%)
requires 16519
50.0%
verification 16519
50.0%
2025-01-23T18:14:29.899640image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 66076
19.0%
r 49557
14.3%
e 49557
14.3%
q 16519
 
4.8%
u 16519
 
4.8%
s 16519
 
4.8%
16519
 
4.8%
v 16519
 
4.8%
f 16519
 
4.8%
c 16519
 
4.8%
Other values (4) 66076
19.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 330380
95.2%
Space Separator 16519
 
4.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 66076
20.0%
r 49557
15.0%
e 49557
15.0%
q 16519
 
5.0%
u 16519
 
5.0%
s 16519
 
5.0%
v 16519
 
5.0%
f 16519
 
5.0%
c 16519
 
5.0%
a 16519
 
5.0%
Other values (3) 49557
15.0%
Space Separator
ValueCountFrequency (%)
16519
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 330380
95.2%
Common 16519
 
4.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 66076
20.0%
r 49557
15.0%
e 49557
15.0%
q 16519
 
5.0%
u 16519
 
5.0%
s 16519
 
5.0%
v 16519
 
5.0%
f 16519
 
5.0%
c 16519
 
5.0%
a 16519
 
5.0%
Other values (3) 49557
15.0%
Common
ValueCountFrequency (%)
16519
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 346899
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 66076
19.0%
r 49557
14.3%
e 49557
14.3%
q 16519
 
4.8%
u 16519
 
4.8%
s 16519
 
4.8%
16519
 
4.8%
v 16519
 
4.8%
f 16519
 
4.8%
c 16519
 
4.8%
Other values (4) 66076
19.0%

occurrenceStatus
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:29.941805image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length7
Mean length7
Min length7

Characters and Unicode

Total characters187523
Distinct characters6
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpresent
2nd rowpresent
3rd rowpresent
4th rowpresent
5th rowpresent
ValueCountFrequency (%)
present 26789
100.0%
2025-01-23T18:14:30.037646image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 53578
28.6%
p 26789
14.3%
r 26789
14.3%
s 26789
14.3%
n 26789
14.3%
t 26789
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 187523
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 53578
28.6%
p 26789
14.3%
r 26789
14.3%
s 26789
14.3%
n 26789
14.3%
t 26789
14.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 187523
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 53578
28.6%
p 26789
14.3%
r 26789
14.3%
s 26789
14.3%
n 26789
14.3%
t 26789
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 187523
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 53578
28.6%
p 26789
14.3%
r 26789
14.3%
s 26789
14.3%
n 26789
14.3%
t 26789
14.3%
Distinct343
Distinct (%)1.3%
Missing4
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:30.184586image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length107
Median length100
Mean length30.45473213
Min length7

Characters and Unicode

Total characters815730
Distinct characters47
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)0.6%

Sample

1st rowwhole organism (ethanol) - 1
2nd rowwhole organism (ethanol) - 1
3rd rowwhole organism (ethanol) - 1
4th rowwhole organism (ethanol) - 1
5th rowwhole organism (ethanol) - 1
ValueCountFrequency (%)
27646
20.1%
whole 25878
18.8%
organism 25878
18.8%
1 19253
14.0%
isopropanol 11340
8.2%
unknown 11179
8.1%
ethanol 3626
 
2.6%
0 2920
 
2.1%
2 1892
 
1.4%
tissue 1213
 
0.9%
Other values (123) 7042
 
5.1%
2025-01-23T18:14:30.360305image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
111082
13.6%
o 101805
 
12.5%
n 75865
 
9.3%
a 42389
 
5.2%
l 41794
 
5.1%
s 40221
 
4.9%
i 39788
 
4.9%
r 38495
 
4.7%
w 37057
 
4.5%
e 32248
 
4.0%
Other values (37) 254986
31.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 586920
72.0%
Space Separator 111082
 
13.6%
Decimal Number 30684
 
3.8%
Dash Punctuation 27646
 
3.4%
Close Punctuation 27228
 
3.3%
Open Punctuation 27228
 
3.3%
Uppercase Letter 2760
 
0.3%
Other Punctuation 2182
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 101805
17.3%
n 75865
12.9%
a 42389
 
7.2%
l 41794
 
7.1%
s 40221
 
6.9%
i 39788
 
6.8%
r 38495
 
6.6%
w 37057
 
6.3%
e 32248
 
5.5%
h 29577
 
5.0%
Other values (11) 107681
18.3%
Decimal Number
ValueCountFrequency (%)
1 20397
66.5%
0 3651
 
11.9%
2 2125
 
6.9%
5 1282
 
4.2%
9 1001
 
3.3%
3 900
 
2.9%
4 598
 
1.9%
6 299
 
1.0%
7 220
 
0.7%
8 211
 
0.7%
Uppercase Letter
ValueCountFrequency (%)
D 666
24.1%
A 365
13.2%
E 333
12.1%
T 333
12.1%
O 333
12.1%
S 333
12.1%
M 333
12.1%
R 32
 
1.2%
N 32
 
1.2%
Other Punctuation
ValueCountFrequency (%)
; 1003
46.0%
% 846
38.8%
/ 333
 
15.3%
Space Separator
ValueCountFrequency (%)
111082
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27646
100.0%
Close Punctuation
ValueCountFrequency (%)
) 27228
100.0%
Open Punctuation
ValueCountFrequency (%)
( 27228
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 589680
72.3%
Common 226050
 
27.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 101805
17.3%
n 75865
12.9%
a 42389
 
7.2%
l 41794
 
7.1%
s 40221
 
6.8%
i 39788
 
6.7%
r 38495
 
6.5%
w 37057
 
6.3%
e 32248
 
5.5%
h 29577
 
5.0%
Other values (20) 110441
18.7%
Common
ValueCountFrequency (%)
111082
49.1%
- 27646
 
12.2%
) 27228
 
12.0%
( 27228
 
12.0%
1 20397
 
9.0%
0 3651
 
1.6%
2 2125
 
0.9%
5 1282
 
0.6%
; 1003
 
0.4%
9 1001
 
0.4%
Other values (7) 3407
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 815730
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
111082
13.6%
o 101805
 
12.5%
n 75865
 
9.3%
a 42389
 
5.2%
l 41794
 
5.1%
s 40221
 
4.9%
i 39788
 
4.9%
r 38495
 
4.7%
w 37057
 
4.5%
e 32248
 
4.0%
Other values (37) 254986
31.3%

occurrenceRemarks
Text

Missing 

Distinct3599
Distinct (%)37.2%
Missing17106
Missing (%)63.9%
Memory size209.4 KiB
2025-01-23T18:14:30.552017image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length255
Median length211
Mean length38.30021687
Min length1

Characters and Unicode

Total characters370861
Distinct characters88
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2978 ?
Unique (%)30.8%

Sample

1st rowfound in collection uncataloged, tag on animal reads 18 May 1956
2nd rowfound uncataloged in collection
3rd rowremoved from CUMV Rept 6022
4th rowremoved from CUMV 6020
5th rowfound uncataloged in Teaching Collection
ValueCountFrequency (%)
specimen 4211
 
7.1%
not 3568
 
6.1%
count 2841
 
4.8%
given 2812
 
4.8%
in 1905
 
3.2%
as 1231
 
2.1%
found 1164
 
2.0%
1014
 
1.7%
kls 818
 
1.4%
originally 784
 
1.3%
Other values (4618) 38567
65.5%
2025-01-23T18:14:30.936462image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
49696
 
13.4%
e 32796
 
8.8%
n 26711
 
7.2%
o 22006
 
5.9%
i 21241
 
5.7%
a 20102
 
5.4%
t 18463
 
5.0%
c 14836
 
4.0%
l 12753
 
3.4%
s 12468
 
3.4%
Other values (78) 139789
37.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 263582
71.1%
Space Separator 49696
 
13.4%
Uppercase Letter 26529
 
7.2%
Decimal Number 17346
 
4.7%
Other Punctuation 11327
 
3.1%
Dash Punctuation 1652
 
0.4%
Open Punctuation 304
 
0.1%
Close Punctuation 303
 
0.1%
Currency Symbol 72
 
< 0.1%
Math Symbol 50
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 32796
12.4%
n 26711
 
10.1%
o 22006
 
8.3%
i 21241
 
8.1%
a 20102
 
7.6%
t 18463
 
7.0%
c 14836
 
5.6%
l 12753
 
4.8%
s 12468
 
4.7%
r 12376
 
4.7%
Other values (16) 69830
26.5%
Uppercase Letter
ValueCountFrequency (%)
S 5820
21.9%
N 1896
 
7.1%
O 1846
 
7.0%
C 1752
 
6.6%
K 1607
 
6.1%
P 1567
 
5.9%
L 1487
 
5.6%
A 1300
 
4.9%
F 1272
 
4.8%
T 974
 
3.7%
Other values (16) 7008
26.4%
Other Punctuation
ValueCountFrequency (%)
. 4797
42.4%
, 3077
27.2%
; 1179
 
10.4%
" 550
 
4.9%
' 515
 
4.5%
: 503
 
4.4%
/ 263
 
2.3%
# 177
 
1.6%
? 118
 
1.0%
& 89
 
0.8%
Other values (4) 59
 
0.5%
Decimal Number
ValueCountFrequency (%)
0 3423
19.7%
1 3090
17.8%
2 2718
15.7%
9 1925
11.1%
7 1773
10.2%
4 1024
 
5.9%
5 949
 
5.5%
3 889
 
5.1%
8 796
 
4.6%
6 759
 
4.4%
Math Symbol
ValueCountFrequency (%)
= 24
48.0%
+ 14
28.0%
~ 6
 
12.0%
> 4
 
8.0%
< 2
 
4.0%
Open Punctuation
ValueCountFrequency (%)
( 299
98.4%
[ 5
 
1.6%
Close Punctuation
ValueCountFrequency (%)
) 298
98.3%
] 5
 
1.7%
Space Separator
ValueCountFrequency (%)
49696
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1652
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 72
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 290111
78.2%
Common 80750
 
21.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 32796
 
11.3%
n 26711
 
9.2%
o 22006
 
7.6%
i 21241
 
7.3%
a 20102
 
6.9%
t 18463
 
6.4%
c 14836
 
5.1%
l 12753
 
4.4%
s 12468
 
4.3%
r 12376
 
4.3%
Other values (42) 96359
33.2%
Common
ValueCountFrequency (%)
49696
61.5%
. 4797
 
5.9%
0 3423
 
4.2%
1 3090
 
3.8%
, 3077
 
3.8%
2 2718
 
3.4%
9 1925
 
2.4%
7 1773
 
2.2%
- 1652
 
2.0%
; 1179
 
1.5%
Other values (26) 7420
 
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 370861
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
49696
 
13.4%
e 32796
 
8.8%
n 26711
 
7.2%
o 22006
 
5.9%
i 21241
 
5.7%
a 20102
 
5.4%
t 18463
 
5.0%
c 14836
 
4.0%
l 12753
 
3.4%
s 12468
 
3.4%
Other values (78) 139789
37.7%
Distinct11791
Distinct (%)44.4%
Missing251
Missing (%)0.9%
Memory size209.4 KiB
2025-01-23T18:14:31.130524image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length50
Median length25
Mean length11.91287964
Min length2

Characters and Unicode

Total characters316144
Distinct characters73
Distinct categories10 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8247 ?
Unique (%)31.1%

Sample

1st rowCSM 2016-010
2nd rowMGO 2016-03
3rd rowMGO 2016-03
4th rowCSM 2016-008
5th rowCSM 2016-008
ValueCountFrequency (%)
ahw 3758
 
7.2%
no 1409
 
2.7%
data 1409
 
2.7%
fhp 1149
 
2.2%
cupre 577
 
1.1%
dlr 576
 
1.1%
jmr 560
 
1.1%
hha 479
 
0.9%
wjh 422
 
0.8%
wjm 410
 
0.8%
Other values (10689) 41598
79.5%
2025-01-23T18:14:31.394777image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 36538
 
11.6%
25815
 
8.2%
9 25613
 
8.1%
A 20294
 
6.4%
2 16295
 
5.2%
0 14486
 
4.6%
J 13466
 
4.3%
U 11950
 
3.8%
R 9590
 
3.0%
H 9013
 
2.9%
Other values (63) 133084
42.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 146612
46.4%
Decimal Number 137402
43.5%
Space Separator 25815
 
8.2%
Lowercase Letter 3153
 
1.0%
Dash Punctuation 2180
 
0.7%
Other Punctuation 822
 
0.3%
Close Punctuation 77
 
< 0.1%
Open Punctuation 77
 
< 0.1%
Connector Punctuation 3
 
< 0.1%
Modifier Symbol 3
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
A 20294
13.8%
J 13466
 
9.2%
U 11950
 
8.2%
R 9590
 
6.5%
H 9013
 
6.1%
M 8065
 
5.5%
P 7465
 
5.1%
N 7124
 
4.9%
W 6728
 
4.6%
L 6663
 
4.5%
Other values (16) 46254
31.5%
Lowercase Letter
ValueCountFrequency (%)
e 410
13.0%
o 319
10.1%
a 289
 
9.2%
i 275
 
8.7%
l 240
 
7.6%
r 209
 
6.6%
n 172
 
5.5%
s 162
 
5.1%
h 161
 
5.1%
d 149
 
4.7%
Other values (15) 767
24.3%
Decimal Number
ValueCountFrequency (%)
1 36538
26.6%
9 25613
18.6%
2 16295
11.9%
0 14486
 
10.5%
3 8388
 
6.1%
4 8169
 
5.9%
7 8149
 
5.9%
5 7788
 
5.7%
8 6737
 
4.9%
6 5239
 
3.8%
Other Punctuation
ValueCountFrequency (%)
? 799
97.2%
& 12
 
1.5%
/ 7
 
0.9%
. 2
 
0.2%
' 1
 
0.1%
, 1
 
0.1%
Space Separator
ValueCountFrequency (%)
25815
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2180
100.0%
Close Punctuation
ValueCountFrequency (%)
) 77
100.0%
Open Punctuation
ValueCountFrequency (%)
( 77
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 166379
52.6%
Latin 149765
47.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 20294
13.6%
J 13466
 
9.0%
U 11950
 
8.0%
R 9590
 
6.4%
H 9013
 
6.0%
M 8065
 
5.4%
P 7465
 
5.0%
N 7124
 
4.8%
W 6728
 
4.5%
L 6663
 
4.4%
Other values (41) 49407
33.0%
Common
ValueCountFrequency (%)
1 36538
22.0%
25815
15.5%
9 25613
15.4%
2 16295
9.8%
0 14486
 
8.7%
3 8388
 
5.0%
4 8169
 
4.9%
7 8149
 
4.9%
5 7788
 
4.7%
8 6737
 
4.0%
Other values (12) 8401
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 316144
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 36538
 
11.6%
25815
 
8.2%
9 25613
 
8.1%
A 20294
 
6.4%
2 16295
 
5.2%
0 14486
 
4.6%
J 13466
 
4.3%
U 11950
 
3.8%
R 9590
 
3.0%
H 9013
 
2.9%
Other values (63) 133084
42.1%

eventDate
Text

Missing 

Distinct6488
Distinct (%)25.9%
Missing1766
Missing (%)6.6%
Memory size209.4 KiB
2025-01-23T18:14:31.594946image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.554849538
Min length4

Characters and Unicode

Total characters239091
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3025 ?
Unique (%)12.1%

Sample

1st row2016-08-10
2nd row2016-08-03
3rd row2016-08-03
4th row2016-07-24
5th row2016-07-23
ValueCountFrequency (%)
1860-01-01 1769
 
7.1%
2004-10-27 190
 
0.8%
2004-12-09 150
 
0.6%
2004-11-09 109
 
0.4%
2008 92
 
0.4%
1912-05-25 79
 
0.3%
1912-06-24 70
 
0.3%
1917-06-01 68
 
0.3%
1912-07-13 67
 
0.3%
2004-11-25 59
 
0.2%
Other values (6478) 22370
89.4%
2025-01-23T18:14:31.856678image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 46333
19.4%
1 43839
18.3%
0 41353
17.3%
9 27128
11.3%
2 16866
 
7.1%
7 12387
 
5.2%
6 10802
 
4.5%
8 10506
 
4.4%
4 10229
 
4.3%
5 10198
 
4.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 192758
80.6%
Dash Punctuation 46333
 
19.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 43839
22.7%
0 41353
21.5%
9 27128
14.1%
2 16866
 
8.7%
7 12387
 
6.4%
6 10802
 
5.6%
8 10506
 
5.5%
4 10229
 
5.3%
5 10198
 
5.3%
3 9450
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 46333
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 239091
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 46333
19.4%
1 43839
18.3%
0 41353
17.3%
9 27128
11.3%
2 16866
 
7.1%
7 12387
 
5.2%
6 10802
 
4.5%
8 10506
 
4.4%
4 10229
 
4.3%
5 10198
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 239091
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 46333
19.4%
1 43839
18.3%
0 41353
17.3%
9 27128
11.3%
2 16866
 
7.1%
7 12387
 
5.2%
6 10802
 
4.5%
8 10506
 
4.4%
4 10229
 
4.3%
5 10198
 
4.3%

startDayOfYear
Text

Missing 

Distinct365
Distinct (%)1.6%
Missing4508
Missing (%)16.8%
Memory size209.4 KiB
2025-01-23T18:14:32.070541image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.677976751
Min length1

Characters and Unicode

Total characters59668
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)< 0.1%

Sample

1st row223
2nd row216
3rd row216
4th row206
5th row205
ValueCountFrequency (%)
1 1781
 
8.0%
195 262
 
1.2%
167 245
 
1.1%
156 211
 
0.9%
301 206
 
0.9%
200 201
 
0.9%
191 181
 
0.8%
190 180
 
0.8%
184 177
 
0.8%
189 176
 
0.8%
Other values (355) 18661
83.8%
2025-01-23T18:14:32.351417image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59668
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring scripts

ValueCountFrequency (%)
Common 59668
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 59668
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

endDayOfYear
Text

Missing 

Distinct365
Distinct (%)1.6%
Missing4508
Missing (%)16.8%
Memory size209.4 KiB
2025-01-23T18:14:32.560574image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length2.677976751
Min length1

Characters and Unicode

Total characters59668
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique7 ?
Unique (%)< 0.1%

Sample

1st row223
2nd row216
3rd row216
4th row206
5th row205
ValueCountFrequency (%)
1 1781
 
8.0%
195 262
 
1.2%
167 245
 
1.1%
156 211
 
0.9%
301 206
 
0.9%
200 201
 
0.9%
191 181
 
0.8%
190 180
 
0.8%
184 177
 
0.8%
189 176
 
0.8%
Other values (355) 18661
83.8%
2025-01-23T18:14:32.840642image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 59668
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring scripts

ValueCountFrequency (%)
Common 59668
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 59668
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 16138
27.0%
2 9070
15.2%
3 5095
 
8.5%
9 4696
 
7.9%
0 4616
 
7.7%
6 4299
 
7.2%
8 4136
 
6.9%
5 4004
 
6.7%
7 3821
 
6.4%
4 3793
 
6.4%

year
Text

Missing 

Distinct147
Distinct (%)0.6%
Missing1766
Missing (%)6.6%
Memory size209.4 KiB
2025-01-23T18:14:33.014514image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters100092
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row2016
2nd row2016
3rd row2016
4th row2016
5th row2016
ValueCountFrequency (%)
1860 1775
 
7.1%
1921 970
 
3.9%
2004 743
 
3.0%
1917 656
 
2.6%
1940 636
 
2.5%
1925 617
 
2.5%
1912 616
 
2.5%
1942 560
 
2.2%
1978 552
 
2.2%
1976 516
 
2.1%
Other values (137) 17382
69.5%
2025-01-23T18:14:33.236900image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 27660
27.6%
9 23795
23.8%
0 8356
 
8.3%
2 7295
 
7.3%
8 6151
 
6.1%
7 5900
 
5.9%
5 5663
 
5.7%
4 5537
 
5.5%
3 4979
 
5.0%
6 4756
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 100092
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 27660
27.6%
9 23795
23.8%
0 8356
 
8.3%
2 7295
 
7.3%
8 6151
 
6.1%
7 5900
 
5.9%
5 5663
 
5.7%
4 5537
 
5.5%
3 4979
 
5.0%
6 4756
 
4.8%

Most occurring scripts

ValueCountFrequency (%)
Common 100092
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 27660
27.6%
9 23795
23.8%
0 8356
 
8.3%
2 7295
 
7.3%
8 6151
 
6.1%
7 5900
 
5.9%
5 5663
 
5.7%
4 5537
 
5.5%
3 4979
 
5.0%
6 4756
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 100092
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 27660
27.6%
9 23795
23.8%
0 8356
 
8.3%
2 7295
 
7.3%
8 6151
 
6.1%
7 5900
 
5.9%
5 5663
 
5.7%
4 5537
 
5.5%
3 4979
 
5.0%
6 4756
 
4.8%

month
Text

Missing 

Distinct12
Distinct (%)< 0.1%
Missing2737
Missing (%)10.2%
Memory size209.4 KiB
2025-01-23T18:14:33.299381image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length1
Mean length1.093339431
Min length1

Characters and Unicode

Total characters26297
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row8
2nd row8
3rd row8
4th row7
5th row7
ValueCountFrequency (%)
7 4547
18.9%
6 4206
17.5%
4 2558
10.6%
8 2495
10.4%
1 2393
9.9%
5 2337
9.7%
3 1582
 
6.6%
9 1153
 
4.8%
10 1104
 
4.6%
11 668
 
2.8%
Other values (2) 1009
 
4.2%
2025-01-23T18:14:33.418302image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5306
20.2%
7 4547
17.3%
6 4206
16.0%
4 2558
9.7%
8 2495
9.5%
5 2337
8.9%
3 1582
 
6.0%
9 1153
 
4.4%
0 1104
 
4.2%
2 1009
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 26297
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5306
20.2%
7 4547
17.3%
6 4206
16.0%
4 2558
9.7%
8 2495
9.5%
5 2337
8.9%
3 1582
 
6.0%
9 1153
 
4.4%
0 1104
 
4.2%
2 1009
 
3.8%

Most occurring scripts

ValueCountFrequency (%)
Common 26297
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5306
20.2%
7 4547
17.3%
6 4206
16.0%
4 2558
9.7%
8 2495
9.5%
5 2337
8.9%
3 1582
 
6.0%
9 1153
 
4.4%
0 1104
 
4.2%
2 1009
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 26297
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5306
20.2%
7 4547
17.3%
6 4206
16.0%
4 2558
9.7%
8 2495
9.5%
5 2337
8.9%
3 1582
 
6.0%
9 1153
 
4.4%
0 1104
 
4.2%
2 1009
 
3.8%

day
Text

Missing 

Distinct31
Distinct (%)0.1%
Missing4508
Missing (%)16.8%
Memory size209.4 KiB
2025-01-23T18:14:33.483425image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length1.641533145
Min length1

Characters and Unicode

Total characters36575
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row10
2nd row3
3rd row3
4th row24
5th row23
ValueCountFrequency (%)
1 2620
 
11.8%
9 903
 
4.1%
5 767
 
3.4%
24 764
 
3.4%
20 763
 
3.4%
12 756
 
3.4%
27 753
 
3.4%
16 753
 
3.4%
25 732
 
3.3%
18 713
 
3.2%
Other values (21) 12757
57.3%
2025-01-23T18:14:33.608268image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 10873
29.7%
2 8562
23.4%
3 2889
 
7.9%
5 2198
 
6.0%
9 2180
 
6.0%
4 2134
 
5.8%
0 2099
 
5.7%
7 1940
 
5.3%
8 1860
 
5.1%
6 1840
 
5.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 36575
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 10873
29.7%
2 8562
23.4%
3 2889
 
7.9%
5 2198
 
6.0%
9 2180
 
6.0%
4 2134
 
5.8%
0 2099
 
5.7%
7 1940
 
5.3%
8 1860
 
5.1%
6 1840
 
5.0%

Most occurring scripts

ValueCountFrequency (%)
Common 36575
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 10873
29.7%
2 8562
23.4%
3 2889
 
7.9%
5 2198
 
6.0%
9 2180
 
6.0%
4 2134
 
5.8%
0 2099
 
5.7%
7 1940
 
5.3%
8 1860
 
5.1%
6 1840
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 36575
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 10873
29.7%
2 8562
23.4%
3 2889
 
7.9%
5 2198
 
6.0%
9 2180
 
6.0%
4 2134
 
5.8%
0 2099
 
5.7%
7 1940
 
5.3%
8 1860
 
5.1%
6 1840
 
5.0%

verbatimEventDate
Text

Missing 

Distinct7593
Distinct (%)28.7%
Missing322
Missing (%)1.2%
Memory size209.4 KiB
2025-01-23T18:14:33.809348image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length33
Median length30
Mean length9.591680206
Min length4

Characters and Unicode

Total characters253863
Distinct characters53
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4042 ?
Unique (%)15.3%

Sample

1st row4 January 2015
2nd row23 June 2011
3rd row4 January 2015
4th row4 November 2014
5th row4 January 2015
ValueCountFrequency (%)
0/0/0 2431
 
9.0%
0---2014-01-01 579
 
2.2%
2004-10-27-2004-10-31 190
 
0.7%
2004-12-09-2004-12-31 139
 
0.5%
2004-11-09-2004-11-20 96
 
0.4%
2008 92
 
0.3%
1917-06-01 68
 
0.3%
4 67
 
0.2%
1912-06-24 66
 
0.2%
2004-11-25-2004-11-30 59
 
0.2%
Other values (7598) 23104
85.9%
2025-01-23T18:14:34.047718image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 44829
17.7%
0 35125
13.8%
/ 34128
13.4%
9 28235
11.1%
- 23001
9.1%
2 20110
7.9%
7 12378
 
4.9%
4 11696
 
4.6%
5 10655
 
4.2%
3 10195
 
4.0%
Other values (43) 23511
9.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 191198
75.3%
Other Punctuation 34867
 
13.7%
Dash Punctuation 23001
 
9.1%
Lowercase Letter 2681
 
1.1%
Uppercase Letter 1688
 
0.7%
Space Separator 424
 
0.2%
Open Punctuation 2
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 525
19.6%
r 289
10.8%
a 280
10.4%
e 239
8.9%
l 221
8.2%
n 207
 
7.7%
g 148
 
5.5%
y 139
 
5.2%
p 134
 
5.0%
c 98
 
3.7%
Other values (12) 401
15.0%
Uppercase Letter
ValueCountFrequency (%)
T 727
43.1%
J 397
23.5%
A 226
 
13.4%
M 138
 
8.2%
O 70
 
4.1%
S 53
 
3.1%
N 44
 
2.6%
F 18
 
1.1%
D 11
 
0.7%
L 3
 
0.2%
Decimal Number
ValueCountFrequency (%)
1 44829
23.4%
0 35125
18.4%
9 28235
14.8%
2 20110
10.5%
7 12378
 
6.5%
4 11696
 
6.1%
5 10655
 
5.6%
3 10195
 
5.3%
8 9029
 
4.7%
6 8946
 
4.7%
Other Punctuation
ValueCountFrequency (%)
/ 34128
97.9%
: 718
 
2.1%
' 12
 
< 0.1%
? 7
 
< 0.1%
, 1
 
< 0.1%
& 1
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 23001
100.0%
Space Separator
ValueCountFrequency (%)
424
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 249494
98.3%
Latin 4369
 
1.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
T 727
16.6%
u 525
12.0%
J 397
 
9.1%
r 289
 
6.6%
a 280
 
6.4%
e 239
 
5.5%
A 226
 
5.2%
l 221
 
5.1%
n 207
 
4.7%
g 148
 
3.4%
Other values (23) 1110
25.4%
Common
ValueCountFrequency (%)
1 44829
18.0%
0 35125
14.1%
/ 34128
13.7%
9 28235
11.3%
- 23001
9.2%
2 20110
8.1%
7 12378
 
5.0%
4 11696
 
4.7%
5 10655
 
4.3%
3 10195
 
4.1%
Other values (10) 19142
7.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 253863
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 44829
17.7%
0 35125
13.8%
/ 34128
13.4%
9 28235
11.1%
- 23001
9.1%
2 20110
7.9%
7 12378
 
4.9%
4 11696
 
4.6%
5 10655
 
4.2%
3 10195
 
4.0%
Other values (43) 23511
9.3%

habitat
Text

Missing 

Distinct3
Distinct (%)3.2%
Missing26694
Missing (%)99.6%
Memory size209.4 KiB
2025-01-23T18:14:34.107383image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length51
Median length5
Mean length5.505263158
Min length5

Characters and Unicode

Total characters523
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)1.1%

Sample

1st rowurban
2nd rowurban
3rd rowurban
4th rowurban
5th rowurban
ValueCountFrequency (%)
urban 92
91.1%
desert 2
 
2.0%
small 1
 
1.0%
stream 1
 
1.0%
floodplain 1
 
1.0%
swamp 1
 
1.0%
and 1
 
1.0%
agricultural 1
 
1.0%
land 1
 
1.0%
2025-01-23T18:14:34.210598image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 100
19.1%
r 97
18.5%
n 95
18.2%
u 94
18.0%
b 92
17.6%
l 7
 
1.3%
6
 
1.1%
d 5
 
1.0%
e 5
 
1.0%
s 5
 
1.0%
Other values (9) 17
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 517
98.9%
Space Separator 6
 
1.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 100
19.3%
r 97
18.8%
n 95
18.4%
u 94
18.2%
b 92
17.8%
l 7
 
1.4%
d 5
 
1.0%
e 5
 
1.0%
s 5
 
1.0%
t 4
 
0.8%
Other values (8) 13
 
2.5%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 517
98.9%
Common 6
 
1.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 100
19.3%
r 97
18.8%
n 95
18.4%
u 94
18.2%
b 92
17.8%
l 7
 
1.4%
d 5
 
1.0%
e 5
 
1.0%
s 5
 
1.0%
t 4
 
0.8%
Other values (8) 13
 
2.5%
Common
ValueCountFrequency (%)
6
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 523
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 100
19.1%
r 97
18.5%
n 95
18.2%
u 94
18.0%
b 92
17.6%
l 7
 
1.3%
6
 
1.1%
d 5
 
1.0%
e 5
 
1.0%
s 5
 
1.0%
Other values (9) 17
 
3.3%

eventRemarks
Text

Missing 

Distinct864
Distinct (%)36.4%
Missing24414
Missing (%)91.1%
Memory size209.4 KiB
2025-01-23T18:14:34.397317image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length255
Median length114
Mean length37.23663158
Min length3

Characters and Unicode

Total characters88437
Distinct characters85
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique583 ?
Unique (%)24.5%

Sample

1st rowCollege Court Apartments
2nd rowToads collected on public golf course on US1
3rd rowToads collected on public golf course on US1
4th rowCollected at Tampa Golfcourse (Rogers Park)
5th rowToads collected on public golf course on US1
ValueCountFrequency (%)
in 538
 
3.6%
collected 412
 
2.7%
of 387
 
2.6%
utm 304
 
2.0%
from 280
 
1.9%
on 191
 
1.3%
to 161
 
1.1%
at 154
 
1.0%
by 136
 
0.9%
133
 
0.9%
Other values (1834) 12405
82.1%
2025-01-23T18:14:34.671344image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12832
 
14.5%
e 7006
 
7.9%
a 4947
 
5.6%
o 4721
 
5.3%
t 4172
 
4.7%
l 4018
 
4.5%
n 3978
 
4.5%
i 3838
 
4.3%
r 3761
 
4.3%
s 3305
 
3.7%
Other values (75) 35859
40.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 57036
64.5%
Space Separator 12832
 
14.5%
Decimal Number 9011
 
10.2%
Uppercase Letter 6262
 
7.1%
Other Punctuation 2930
 
3.3%
Dash Punctuation 198
 
0.2%
Close Punctuation 74
 
0.1%
Open Punctuation 69
 
0.1%
Math Symbol 17
 
< 0.1%
Other Symbol 7
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 7006
12.3%
a 4947
 
8.7%
o 4721
 
8.3%
t 4172
 
7.3%
l 4018
 
7.0%
n 3978
 
7.0%
i 3838
 
6.7%
r 3761
 
6.6%
s 3305
 
5.8%
d 2875
 
5.0%
Other values (16) 14415
25.3%
Uppercase Letter
ValueCountFrequency (%)
T 680
 
10.9%
S 596
 
9.5%
M 558
 
8.9%
U 482
 
7.7%
C 478
 
7.6%
L 384
 
6.1%
R 370
 
5.9%
P 297
 
4.7%
N 274
 
4.4%
A 260
 
4.2%
Other values (16) 1883
30.1%
Other Punctuation
ValueCountFrequency (%)
. 1104
37.7%
, 671
22.9%
' 342
 
11.7%
; 221
 
7.5%
: 158
 
5.4%
# 140
 
4.8%
" 127
 
4.3%
/ 52
 
1.8%
& 46
 
1.6%
* 43
 
1.5%
Other values (3) 26
 
0.9%
Decimal Number
ValueCountFrequency (%)
1 1520
16.9%
0 1074
11.9%
4 961
10.7%
2 932
10.3%
9 920
10.2%
5 855
9.5%
7 791
8.8%
6 788
8.7%
8 649
7.2%
3 521
 
5.8%
Math Symbol
ValueCountFrequency (%)
~ 10
58.8%
< 4
 
23.5%
= 2
 
11.8%
> 1
 
5.9%
Space Separator
ValueCountFrequency (%)
12832
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 198
100.0%
Close Punctuation
ValueCountFrequency (%)
) 74
100.0%
Open Punctuation
ValueCountFrequency (%)
( 69
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 63298
71.6%
Common 25139
 
28.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 7006
 
11.1%
a 4947
 
7.8%
o 4721
 
7.5%
t 4172
 
6.6%
l 4018
 
6.3%
n 3978
 
6.3%
i 3838
 
6.1%
r 3761
 
5.9%
s 3305
 
5.2%
d 2875
 
4.5%
Other values (42) 20677
32.7%
Common
ValueCountFrequency (%)
12832
51.0%
1 1520
 
6.0%
. 1104
 
4.4%
0 1074
 
4.3%
4 961
 
3.8%
2 932
 
3.7%
9 920
 
3.7%
5 855
 
3.4%
7 791
 
3.1%
6 788
 
3.1%
Other values (23) 3362
 
13.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 88430
> 99.9%
Specials 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
12832
 
14.5%
e 7006
 
7.9%
a 4947
 
5.6%
o 4721
 
5.3%
t 4172
 
4.7%
l 4018
 
4.5%
n 3978
 
4.5%
i 3838
 
4.3%
r 3761
 
4.3%
s 3305
 
3.7%
Other values (74) 35852
40.5%
Specials
ValueCountFrequency (%)
7
100.0%
Distinct1398
Distinct (%)5.2%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:34.859439image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length118
Median length93
Mean length63.58003584
Min length24

Characters and Unicode

Total characters1703182
Distinct characters56
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique384 ?
Unique (%)1.4%

Sample

1st rowNorth America | United States | Florida | Hillsborough County | | | |
2nd rowNorth America | United States | Texas | Hidalgo County | | | |
3rd rowNorth America | United States | Texas | Hidalgo County | | | |
4th rowNorth America | United States | Florida | Lee County | | | |
5th rowNorth America | United States | Florida | Lee County | | | |
ValueCountFrequency (%)
187516
51.6%
america 20715
 
5.7%
north 20330
 
5.6%
united 20166
 
5.5%
states 20080
 
5.5%
county 16818
 
4.6%
new 4868
 
1.3%
york 3880
 
1.1%
australia 3687
 
1.0%
georgia 2839
 
0.8%
Other values (1201) 62582
 
17.2%
2025-01-23T18:14:35.123851image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
428932
25.2%
| 187516
11.0%
t 119172
 
7.0%
e 109365
 
6.4%
a 92263
 
5.4%
r 84694
 
5.0%
i 80356
 
4.7%
o 78348
 
4.6%
n 67292
 
4.0%
s 44314
 
2.6%
Other values (46) 410930
24.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 911162
53.5%
Space Separator 428932
25.2%
Math Symbol 187516
 
11.0%
Uppercase Letter 175031
 
10.3%
Dash Punctuation 457
 
< 0.1%
Other Punctuation 84
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 119172
13.1%
e 109365
12.0%
a 92263
10.1%
r 84694
9.3%
i 80356
8.8%
o 78348
8.6%
n 67292
 
7.4%
s 44314
 
4.9%
h 33966
 
3.7%
d 31768
 
3.5%
Other values (16) 169624
18.6%
Uppercase Letter
ValueCountFrequency (%)
A 30129
17.2%
C 26181
15.0%
N 26030
14.9%
S 22903
13.1%
U 20576
11.8%
G 6097
 
3.5%
T 4743
 
2.7%
W 4360
 
2.5%
P 4311
 
2.5%
Y 3993
 
2.3%
Other values (16) 25708
14.7%
Space Separator
ValueCountFrequency (%)
428932
100.0%
Math Symbol
ValueCountFrequency (%)
| 187516
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 457
100.0%
Other Punctuation
ValueCountFrequency (%)
' 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1086193
63.8%
Common 616989
36.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 119172
 
11.0%
e 109365
 
10.1%
a 92263
 
8.5%
r 84694
 
7.8%
i 80356
 
7.4%
o 78348
 
7.2%
n 67292
 
6.2%
s 44314
 
4.1%
h 33966
 
3.1%
d 31768
 
2.9%
Other values (42) 344655
31.7%
Common
ValueCountFrequency (%)
428932
69.5%
| 187516
30.4%
- 457
 
0.1%
' 84
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1703182
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
428932
25.2%
| 187516
11.0%
t 119172
 
7.0%
e 109365
 
6.4%
a 92263
 
5.4%
r 84694
 
5.0%
i 80356
 
4.7%
o 78348
 
4.6%
n 67292
 
4.0%
s 44314
 
2.6%
Other values (46) 410930
24.1%

continent
Text

Missing 

Distinct7
Distinct (%)< 0.1%
Missing1793
Missing (%)6.7%
Memory size209.4 KiB
2025-01-23T18:14:35.183079image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.51088174
Min length4

Characters and Unicode

Total characters312722
Distinct characters21
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNorth America
2nd rowNorth America
3rd rowNorth America
4th rowNorth America
5th rowNorth America
ValueCountFrequency (%)
america 22413
47.3%
north 22156
46.7%
australasia 1262
 
2.7%
africa 772
 
1.6%
south 257
 
0.5%
asia 254
 
0.5%
europe 242
 
0.5%
oceania 53
 
0.1%
2025-01-23T18:14:35.285350image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 46845
15.0%
a 27331
8.7%
i 24754
7.9%
A 24701
7.9%
t 23675
7.6%
c 23238
7.4%
e 22708
7.3%
o 22655
7.2%
h 22413
7.2%
22413
7.2%
Other values (11) 51989
16.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 242900
77.7%
Uppercase Letter 47409
 
15.2%
Space Separator 22413
 
7.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 46845
19.3%
a 27331
11.3%
i 24754
10.2%
t 23675
9.7%
c 23238
9.6%
e 22708
9.3%
o 22655
9.3%
h 22413
9.2%
m 22413
9.2%
s 2778
 
1.1%
Other values (5) 4090
 
1.7%
Uppercase Letter
ValueCountFrequency (%)
A 24701
52.1%
N 22156
46.7%
S 257
 
0.5%
E 242
 
0.5%
O 53
 
0.1%
Space Separator
ValueCountFrequency (%)
22413
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 290309
92.8%
Common 22413
 
7.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 46845
16.1%
a 27331
9.4%
i 24754
8.5%
A 24701
8.5%
t 23675
8.2%
c 23238
8.0%
e 22708
7.8%
o 22655
7.8%
h 22413
7.7%
m 22413
7.7%
Other values (10) 29576
10.2%
Common
ValueCountFrequency (%)
22413
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 312722
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 46845
15.0%
a 27331
8.7%
i 24754
7.9%
A 24701
7.9%
t 23675
7.6%
c 23238
7.4%
e 22708
7.3%
o 22655
7.2%
h 22413
7.2%
22413
7.2%
Other values (11) 51989
16.6%

waterBody
Text

Missing 

Distinct2
Distinct (%)14.3%
Missing26775
Missing (%)99.9%
Memory size209.4 KiB
2025-01-23T18:14:35.334382image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length19
Mean length16.42857143
Min length13

Characters and Unicode

Total characters230
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSouth Pacific Ocean
2nd rowSouth Pacific Ocean
3rd rowSouth Pacific Ocean
4th rowPacific Ocean
5th rowSouth Pacific Ocean
ValueCountFrequency (%)
pacific 14
38.9%
ocean 14
38.9%
south 8
22.2%
2025-01-23T18:14:35.435525image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
c 42
18.3%
a 28
12.2%
i 28
12.2%
22
9.6%
P 14
 
6.1%
f 14
 
6.1%
O 14
 
6.1%
e 14
 
6.1%
n 14
 
6.1%
S 8
 
3.5%
Other values (4) 32
13.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 172
74.8%
Uppercase Letter 36
 
15.7%
Space Separator 22
 
9.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
c 42
24.4%
a 28
16.3%
i 28
16.3%
f 14
 
8.1%
e 14
 
8.1%
n 14
 
8.1%
o 8
 
4.7%
u 8
 
4.7%
t 8
 
4.7%
h 8
 
4.7%
Uppercase Letter
ValueCountFrequency (%)
P 14
38.9%
O 14
38.9%
S 8
22.2%
Space Separator
ValueCountFrequency (%)
22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 208
90.4%
Common 22
 
9.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
c 42
20.2%
a 28
13.5%
i 28
13.5%
P 14
 
6.7%
f 14
 
6.7%
O 14
 
6.7%
e 14
 
6.7%
n 14
 
6.7%
S 8
 
3.8%
o 8
 
3.8%
Other values (3) 24
11.5%
Common
ValueCountFrequency (%)
22
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 230
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
c 42
18.3%
a 28
12.2%
i 28
12.2%
22
9.6%
P 14
 
6.1%
f 14
 
6.1%
O 14
 
6.1%
e 14
 
6.1%
n 14
 
6.1%
S 8
 
3.5%
Other values (4) 32
13.9%

islandGroup
Text

Missing 

Distinct13
Distinct (%)0.7%
Missing25019
Missing (%)93.4%
Memory size209.4 KiB
2025-01-23T18:14:35.491975image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length20
Median length16
Mean length15.90225989
Min length7

Characters and Unicode

Total characters28147
Distinct characters30
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st rowGreater Antilles
2nd rowGreater Antilles
3rd rowGreater Antilles
4th rowGreater Antilles
5th rowGreater Antilles
ValueCountFrequency (%)
antilles 1677
47.4%
greater 1658
46.9%
british 42
 
1.2%
isles 42
 
1.2%
islands 36
 
1.0%
lesser 19
 
0.5%
solomon 14
 
0.4%
archipelago 11
 
0.3%
japanese 10
 
0.3%
bahama 9
 
0.3%
Other values (7) 18
 
0.5%
2025-01-23T18:14:35.616616image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 5107
18.1%
l 3458
12.3%
r 3392
12.1%
t 3377
12.0%
s 1927
 
6.8%
a 1795
 
6.4%
i 1794
 
6.4%
1766
 
6.3%
n 1754
 
6.2%
A 1688
 
6.0%
Other values (20) 2089
7.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 22845
81.2%
Uppercase Letter 3536
 
12.6%
Space Separator 1766
 
6.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 5107
22.4%
l 3458
15.1%
r 3392
14.8%
t 3377
14.8%
s 1927
 
8.4%
a 1795
 
7.9%
i 1794
 
7.9%
n 1754
 
7.7%
h 63
 
0.3%
o 55
 
0.2%
Other values (8) 123
 
0.5%
Uppercase Letter
ValueCountFrequency (%)
A 1688
47.7%
G 1658
46.9%
I 78
 
2.2%
B 51
 
1.4%
L 20
 
0.6%
S 16
 
0.5%
J 10
 
0.3%
H 8
 
0.2%
C 3
 
0.1%
V 2
 
0.1%
Space Separator
ValueCountFrequency (%)
1766
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 26381
93.7%
Common 1766
 
6.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 5107
19.4%
l 3458
13.1%
r 3392
12.9%
t 3377
12.8%
s 1927
 
7.3%
a 1795
 
6.8%
i 1794
 
6.8%
n 1754
 
6.6%
A 1688
 
6.4%
G 1658
 
6.3%
Other values (19) 431
 
1.6%
Common
ValueCountFrequency (%)
1766
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28147
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 5107
18.1%
l 3458
12.3%
r 3392
12.1%
t 3377
12.0%
s 1927
 
6.8%
a 1795
 
6.4%
i 1794
 
6.4%
1766
 
6.3%
n 1754
 
6.2%
A 1688
 
6.0%
Other values (20) 2089
7.4%

island
Text

Missing 

Distinct31
Distinct (%)6.6%
Missing26322
Missing (%)98.3%
Memory size209.4 KiB
2025-01-23T18:14:35.691954image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length12
Mean length10.01498929
Min length4

Characters and Unicode

Total characters4677
Distinct characters40
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6 ?
Unique (%)1.3%

Sample

1st rowBioko Island
2nd rowBioko Island
3rd rowBioko Island
4th rowBioko Island
5th rowHispaniola
ValueCountFrequency (%)
island 223
30.4%
bioko 222
30.2%
jamaica 50
 
6.8%
bermuda 40
 
5.4%
hispaniola 36
 
4.9%
saint 27
 
3.7%
croix 26
 
3.5%
new 11
 
1.5%
cuba 11
 
1.5%
eleuthera 8
 
1.1%
Other values (26) 80
 
10.9%
2025-01-23T18:14:35.829937image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 620
13.3%
o 546
11.7%
i 442
 
9.5%
n 338
 
7.2%
d 284
 
6.1%
l 280
 
6.0%
s 275
 
5.9%
B 273
 
5.8%
267
 
5.7%
k 226
 
4.8%
Other values (30) 1126
24.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3676
78.6%
Uppercase Letter 734
 
15.7%
Space Separator 267
 
5.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 620
16.9%
o 546
14.9%
i 442
12.0%
n 338
9.2%
d 284
7.7%
l 280
7.6%
s 275
7.5%
k 226
 
6.1%
r 122
 
3.3%
e 110
 
3.0%
Other values (14) 433
11.8%
Uppercase Letter
ValueCountFrequency (%)
B 273
37.2%
I 224
30.5%
J 51
 
6.9%
H 40
 
5.4%
C 39
 
5.3%
S 36
 
4.9%
G 20
 
2.7%
N 15
 
2.0%
M 13
 
1.8%
E 8
 
1.1%
Other values (5) 15
 
2.0%
Space Separator
ValueCountFrequency (%)
267
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4410
94.3%
Common 267
 
5.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 620
14.1%
o 546
12.4%
i 442
10.0%
n 338
 
7.7%
d 284
 
6.4%
l 280
 
6.3%
s 275
 
6.2%
B 273
 
6.2%
k 226
 
5.1%
I 224
 
5.1%
Other values (29) 902
20.5%
Common
ValueCountFrequency (%)
267
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4677
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 620
13.3%
o 546
11.7%
i 442
 
9.5%
n 338
 
7.2%
d 284
 
6.1%
l 280
 
6.0%
s 275
 
5.9%
B 273
 
5.8%
267
 
5.7%
k 226
 
4.8%
Other values (30) 1126
24.1%

country
Text

Missing 

Distinct89
Distinct (%)0.4%
Missing1917
Missing (%)7.2%
Memory size209.4 KiB
2025-01-23T18:14:35.930239image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length13
Mean length12.07132518
Min length4

Characters and Unicode

Total characters300238
Distinct characters48
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st rowUnited States
2nd rowUnited States
3rd rowUnited States
4th rowUnited States
5th rowUnited States
ValueCountFrequency (%)
united 19822
43.0%
states 19737
42.8%
australia 1262
 
2.7%
mexico 698
 
1.5%
costa 641
 
1.4%
rica 641
 
1.4%
gabon 327
 
0.7%
canada 326
 
0.7%
puerto 289
 
0.6%
rico 289
 
0.6%
Other values (97) 2055
 
4.5%
2025-01-23T18:14:36.096208image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
t 61858
20.6%
e 41239
13.7%
a 27695
9.2%
i 24194
 
8.1%
s 21874
 
7.3%
n 21762
 
7.2%
21215
 
7.1%
d 20415
 
6.8%
S 19875
 
6.6%
U 19829
 
6.6%
Other values (38) 20282
 
6.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 232975
77.6%
Uppercase Letter 46048
 
15.3%
Space Separator 21215
 
7.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
t 61858
26.6%
e 41239
17.7%
a 27695
11.9%
i 24194
 
10.4%
s 21874
 
9.4%
n 21762
 
9.3%
d 20415
 
8.8%
o 2890
 
1.2%
u 2242
 
1.0%
r 2167
 
0.9%
Other values (16) 6639
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
S 19875
43.2%
U 19829
43.1%
A 1304
 
2.8%
C 1076
 
2.3%
R 996
 
2.2%
M 782
 
1.7%
P 608
 
1.3%
G 601
 
1.3%
E 232
 
0.5%
I 138
 
0.3%
Other values (11) 607
 
1.3%
Space Separator
ValueCountFrequency (%)
21215
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 279023
92.9%
Common 21215
 
7.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
t 61858
22.2%
e 41239
14.8%
a 27695
9.9%
i 24194
 
8.7%
s 21874
 
7.8%
n 21762
 
7.8%
d 20415
 
7.3%
S 19875
 
7.1%
U 19829
 
7.1%
o 2890
 
1.0%
Other values (37) 17392
 
6.2%
Common
ValueCountFrequency (%)
21215
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 300238
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
t 61858
20.6%
e 41239
13.7%
a 27695
9.2%
i 24194
 
8.1%
s 21874
 
7.3%
n 21762
 
7.2%
21215
 
7.1%
d 20415
 
6.8%
S 19875
 
6.6%
U 19829
 
6.6%
Other values (38) 20282
 
6.8%

countryCode
Text

Missing 

Distinct89
Distinct (%)0.4%
Missing1917
Missing (%)7.2%
Memory size209.4 KiB
2025-01-23T18:14:36.184840image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters49744
Distinct characters26
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)0.1%

Sample

1st rowUS
2nd rowUS
3rd rowUS
4th rowUS
5th rowUS
ValueCountFrequency (%)
us 19737
79.4%
au 1262
 
5.1%
mx 698
 
2.8%
cr 641
 
2.6%
ga 327
 
1.3%
ca 326
 
1.3%
pr 289
 
1.2%
pa 227
 
0.9%
gq 222
 
0.9%
br 117
 
0.5%
Other values (79) 1026
 
4.1%
2025-01-23T18:14:36.324198image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
U 21019
42.3%
S 19832
39.9%
A 2186
 
4.4%
R 1131
 
2.3%
C 1088
 
2.2%
M 847
 
1.7%
X 698
 
1.4%
G 695
 
1.4%
P 622
 
1.3%
B 233
 
0.5%
Other values (16) 1393
 
2.8%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 49744
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
U 21019
42.3%
S 19832
39.9%
A 2186
 
4.4%
R 1131
 
2.3%
C 1088
 
2.2%
M 847
 
1.7%
X 698
 
1.4%
G 695
 
1.4%
P 622
 
1.3%
B 233
 
0.5%
Other values (16) 1393
 
2.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 49744
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
U 21019
42.3%
S 19832
39.9%
A 2186
 
4.4%
R 1131
 
2.3%
C 1088
 
2.2%
M 847
 
1.7%
X 698
 
1.4%
G 695
 
1.4%
P 622
 
1.3%
B 233
 
0.5%
Other values (16) 1393
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 49744
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
U 21019
42.3%
S 19832
39.9%
A 2186
 
4.4%
R 1131
 
2.3%
C 1088
 
2.2%
M 847
 
1.7%
X 698
 
1.4%
G 695
 
1.4%
P 622
 
1.3%
B 233
 
0.5%
Other values (16) 1393
 
2.8%

stateProvince
Text

Missing 

Distinct173
Distinct (%)0.8%
Missing4766
Missing (%)17.8%
Memory size209.4 KiB
2025-01-23T18:14:36.472142image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length26
Median length25
Mean length8.648367616
Min length4

Characters and Unicode

Total characters190463
Distinct characters58
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique33 ?
Unique (%)0.1%

Sample

1st rowFlorida
2nd rowTexas
3rd rowTexas
4th rowFlorida
5th rowFlorida
ValueCountFrequency (%)
new 4816
16.2%
york 3876
13.0%
georgia 2833
 
9.5%
florida 1656
 
5.6%
texas 1464
 
4.9%
california 1365
 
4.6%
australia 1163
 
3.9%
western 1152
 
3.9%
carolina 985
 
3.3%
arizona 975
 
3.3%
Other values (196) 9445
31.8%
2025-01-23T18:14:36.695635image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 21707
 
11.4%
r 18281
 
9.6%
o 17177
 
9.0%
i 17067
 
9.0%
e 16287
 
8.6%
n 10119
 
5.3%
s 8214
 
4.3%
7707
 
4.0%
l 6920
 
3.6%
N 5742
 
3.0%
Other values (48) 61242
32.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 152486
80.1%
Uppercase Letter 29981
 
15.7%
Space Separator 7707
 
4.0%
Dash Punctuation 286
 
0.2%
Other Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 21707
14.2%
r 18281
12.0%
o 17177
11.3%
i 17067
11.2%
e 16287
10.7%
n 10119
 
6.6%
s 8214
 
5.4%
l 6920
 
4.5%
t 5148
 
3.4%
w 4926
 
3.2%
Other values (21) 26640
17.5%
Uppercase Letter
ValueCountFrequency (%)
N 5742
19.2%
Y 3880
12.9%
G 3031
10.1%
C 2786
9.3%
A 2459
8.2%
W 1825
 
6.1%
F 1662
 
5.5%
T 1657
 
5.5%
M 1263
 
4.2%
O 997
 
3.3%
Other values (14) 4679
15.6%
Space Separator
ValueCountFrequency (%)
7707
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 286
100.0%
Other Punctuation
ValueCountFrequency (%)
' 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 182467
95.8%
Common 7996
 
4.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 21707
 
11.9%
r 18281
 
10.0%
o 17177
 
9.4%
i 17067
 
9.4%
e 16287
 
8.9%
n 10119
 
5.5%
s 8214
 
4.5%
l 6920
 
3.8%
N 5742
 
3.1%
t 5148
 
2.8%
Other values (45) 55805
30.6%
Common
ValueCountFrequency (%)
7707
96.4%
- 286
 
3.6%
' 3
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 189831
99.7%
None 632
 
0.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 21707
 
11.4%
r 18281
 
9.6%
o 17177
 
9.0%
i 17067
 
9.0%
e 16287
 
8.6%
n 10119
 
5.3%
s 8214
 
4.3%
7707
 
4.1%
l 6920
 
3.6%
N 5742
 
3.0%
Other values (42) 60610
31.9%
None
ValueCountFrequency (%)
é 273
43.2%
á 175
27.7%
ó 161
25.5%
í 15
 
2.4%
ã 4
 
0.6%
ô 4
 
0.6%

county
Text

Missing 

Distinct800
Distinct (%)4.6%
Missing9234
Missing (%)34.5%
Memory size209.4 KiB
2025-01-23T18:14:36.895442image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length38
Median length22
Mean length7.66174879
Min length3

Characters and Unicode

Total characters134502
Distinct characters62
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)1.0%

Sample

1st rowHillsborough
2nd rowHidalgo
3rd rowHidalgo
4th rowLee
5th rowLee
ValueCountFrequency (%)
tompkins 2004
 
10.4%
charlton 1751
 
9.1%
chattahoochee 537
 
2.8%
san 380
 
2.0%
cochise 352
 
1.8%
chemung 308
 
1.6%
lee 298
 
1.5%
rio 290
 
1.5%
grande 289
 
1.5%
diego 239
 
1.2%
Other values (831) 12861
66.6%
2025-01-23T18:14:37.161378image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 12571
 
9.3%
o 12080
 
9.0%
e 11647
 
8.7%
n 11174
 
8.3%
r 8029
 
6.0%
i 7819
 
5.8%
s 6500
 
4.8%
t 6305
 
4.7%
h 6163
 
4.6%
l 6069
 
4.5%
Other values (52) 46145
34.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 112832
83.9%
Uppercase Letter 19638
 
14.6%
Space Separator 1754
 
1.3%
Dash Punctuation 170
 
0.1%
Other Punctuation 106
 
0.1%
Decimal Number 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 12571
11.1%
o 12080
10.7%
e 11647
10.3%
n 11174
9.9%
r 8029
 
7.1%
i 7819
 
6.9%
s 6500
 
5.8%
t 6305
 
5.6%
h 6163
 
5.5%
l 6069
 
5.4%
Other values (20) 24475
21.7%
Uppercase Letter
ValueCountFrequency (%)
C 4310
21.9%
T 2365
12.0%
S 1373
 
7.0%
M 1242
 
6.3%
P 1014
 
5.2%
L 922
 
4.7%
B 903
 
4.6%
A 805
 
4.1%
D 803
 
4.1%
H 788
 
4.0%
Other values (16) 5113
26.0%
Other Punctuation
ValueCountFrequency (%)
' 76
71.7%
. 30
 
28.3%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
0 1
50.0%
Space Separator
ValueCountFrequency (%)
1754
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 170
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 132470
98.5%
Common 2032
 
1.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 12571
 
9.5%
o 12080
 
9.1%
e 11647
 
8.8%
n 11174
 
8.4%
r 8029
 
6.1%
i 7819
 
5.9%
s 6500
 
4.9%
t 6305
 
4.8%
h 6163
 
4.7%
l 6069
 
4.6%
Other values (46) 44113
33.3%
Common
ValueCountFrequency (%)
1754
86.3%
- 170
 
8.4%
' 76
 
3.7%
. 30
 
1.5%
1 1
 
< 0.1%
0 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 134418
99.9%
None 84
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 12571
 
9.4%
o 12080
 
9.0%
e 11647
 
8.7%
n 11174
 
8.3%
r 8029
 
6.0%
i 7819
 
5.8%
s 6500
 
4.8%
t 6305
 
4.7%
h 6163
 
4.6%
l 6069
 
4.5%
Other values (47) 46061
34.3%
None
ValueCountFrequency (%)
ñ 58
69.0%
é 10
 
11.9%
Î 10
 
11.9%
ô 5
 
6.0%
è 1
 
1.2%

municipality
Text

Missing 

Distinct2
Distinct (%)50.0%
Missing26785
Missing (%)> 99.9%
Memory size209.4 KiB
2025-01-23T18:14:37.219208image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.25
Min length7

Characters and Unicode

Total characters37
Distinct characters13
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)25.0%

Sample

1st rowSherbrooke
2nd rowSherbrooke
3rd rowSherbrooke
4th rowMadison
ValueCountFrequency (%)
sherbrooke 3
75.0%
madison 1
 
25.0%
2025-01-23T18:14:37.326488image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 7
18.9%
e 6
16.2%
r 6
16.2%
S 3
8.1%
h 3
8.1%
b 3
8.1%
k 3
8.1%
M 1
 
2.7%
a 1
 
2.7%
d 1
 
2.7%
Other values (3) 3
8.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 33
89.2%
Uppercase Letter 4
 
10.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 7
21.2%
e 6
18.2%
r 6
18.2%
h 3
9.1%
b 3
9.1%
k 3
9.1%
a 1
 
3.0%
d 1
 
3.0%
i 1
 
3.0%
s 1
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 3
75.0%
M 1
 
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 37
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 7
18.9%
e 6
16.2%
r 6
16.2%
S 3
8.1%
h 3
8.1%
b 3
8.1%
k 3
8.1%
M 1
 
2.7%
a 1
 
2.7%
d 1
 
2.7%
Other values (3) 3
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 7
18.9%
e 6
16.2%
r 6
16.2%
S 3
8.1%
h 3
8.1%
b 3
8.1%
k 3
8.1%
M 1
 
2.7%
a 1
 
2.7%
d 1
 
2.7%
Other values (3) 3
8.1%
Distinct8080
Distinct (%)30.2%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:37.515863image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length127
Median length99
Mean length26.19811109
Min length3

Characters and Unicode

Total characters701795
Distinct characters86
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5365 ?
Unique (%)20.0%

Sample

1st rowTampa
2nd rowEdinburg, 8 km NW
3rd rowEdinburg, 8 km NW
4th rowFort Myers, Fort Myers Country Club
5th rowFort Myers, Fort Myers Country Club
ValueCountFrequency (%)
mi 4808
 
4.0%
of 4084
 
3.4%
unspecified 2291
 
1.9%
swamp 1826
 
1.5%
island 1826
 
1.5%
okefenokee 1567
 
1.3%
site 1364
 
1.1%
km 1305
 
1.1%
1 1255
 
1.1%
billy's 1233
 
1.0%
Other values (6607) 97868
81.9%
2025-01-23T18:14:37.789535image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92830
 
13.2%
e 54528
 
7.8%
a 48726
 
6.9%
i 40557
 
5.8%
n 39807
 
5.7%
o 39679
 
5.7%
r 30089
 
4.3%
l 29291
 
4.2%
t 28994
 
4.1%
s 24303
 
3.5%
Other values (76) 272991
38.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 464963
66.3%
Space Separator 92830
 
13.2%
Uppercase Letter 88874
 
12.7%
Other Punctuation 30479
 
4.3%
Decimal Number 20870
 
3.0%
Dash Punctuation 928
 
0.1%
Open Punctuation 901
 
0.1%
Close Punctuation 897
 
0.1%
Math Symbol 692
 
0.1%
Connector Punctuation 222
 
< 0.1%
Other values (2) 139
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 54528
11.7%
a 48726
10.5%
i 40557
 
8.7%
n 39807
 
8.6%
o 39679
 
8.5%
r 30089
 
6.5%
l 29291
 
6.3%
t 28994
 
6.2%
s 24303
 
5.2%
d 17515
 
3.8%
Other values (16) 111474
24.0%
Uppercase Letter
ValueCountFrequency (%)
S 12328
13.9%
C 7039
 
7.9%
R 6813
 
7.7%
B 5980
 
6.7%
P 5500
 
6.2%
M 5477
 
6.2%
L 5313
 
6.0%
N 5035
 
5.7%
E 4829
 
5.4%
W 4141
 
4.7%
Other values (16) 26419
29.7%
Other Punctuation
ValueCountFrequency (%)
, 13540
44.4%
. 12657
41.5%
' 2267
 
7.4%
: 621
 
2.0%
/ 382
 
1.3%
; 359
 
1.2%
& 293
 
1.0%
# 147
 
0.5%
? 144
 
0.5%
" 34
 
0.1%
Other values (3) 35
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 4746
22.7%
2 2907
13.9%
0 2579
12.4%
5 2376
11.4%
3 2187
10.5%
9 1545
 
7.4%
4 1342
 
6.4%
7 1090
 
5.2%
6 1089
 
5.2%
8 1009
 
4.8%
Open Punctuation
ValueCountFrequency (%)
( 876
97.2%
[ 25
 
2.8%
Close Punctuation
ValueCountFrequency (%)
) 872
97.2%
] 25
 
2.8%
Math Symbol
ValueCountFrequency (%)
~ 687
99.3%
= 5
 
0.7%
Space Separator
ValueCountFrequency (%)
92830
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 928
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 222
100.0%
Other Symbol
ValueCountFrequency (%)
134
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 553837
78.9%
Common 147958
 
21.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 54528
 
9.8%
a 48726
 
8.8%
i 40557
 
7.3%
n 39807
 
7.2%
o 39679
 
7.2%
r 30089
 
5.4%
l 29291
 
5.3%
t 28994
 
5.2%
s 24303
 
4.4%
d 17515
 
3.2%
Other values (42) 200348
36.2%
Common
ValueCountFrequency (%)
92830
62.7%
, 13540
 
9.2%
. 12657
 
8.6%
1 4746
 
3.2%
2 2907
 
2.0%
0 2579
 
1.7%
5 2376
 
1.6%
' 2267
 
1.5%
3 2187
 
1.5%
9 1545
 
1.0%
Other values (24) 10324
 
7.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 701661
> 99.9%
Specials 134
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
92830
 
13.2%
e 54528
 
7.8%
a 48726
 
6.9%
i 40557
 
5.8%
n 39807
 
5.7%
o 39679
 
5.7%
r 30089
 
4.3%
l 29291
 
4.2%
t 28994
 
4.1%
s 24303
 
3.5%
Other values (75) 272857
38.9%
Specials
ValueCountFrequency (%)
134
100.0%
Distinct8562
Distinct (%)32.0%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:37.984857image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length189
Median length150
Mean length79.9117142
Min length13

Characters and Unicode

Total characters2140675
Distinct characters87
Distinct categories12 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5720 ?
Unique (%)21.4%

Sample

1st rowNorth America | United States | Florida | Hillsborough County
2nd rowNorth America | United States | Texas | Hidalgo County
3rd rowNorth America | United States | Texas | Hidalgo County
4th rowNorth America | United States | Florida | Lee County
5th rowNorth America | United States | Florida | Lee County
ValueCountFrequency (%)
95902
24.7%
america 20716
 
5.3%
north 20473
 
5.3%
united 20168
 
5.2%
states 20082
 
5.2%
county 16915
 
4.4%
new 5014
 
1.3%
mi 4836
 
1.2%
of 4075
 
1.0%
york 3889
 
1.0%
Other values (7200) 176397
45.4%
2025-01-23T18:14:38.373837image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
361863
16.9%
e 163257
 
7.6%
t 148412
 
6.9%
a 141454
 
6.6%
i 120775
 
5.6%
o 117930
 
5.5%
r 114589
 
5.4%
n 106516
 
5.0%
| 95425
 
4.5%
s 68302
 
3.2%
Other values (77) 702152
32.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1372484
64.1%
Space Separator 361863
 
16.9%
Uppercase Letter 256160
 
12.0%
Math Symbol 95967
 
4.5%
Other Punctuation 30020
 
1.4%
Decimal Number 20625
 
1.0%
Dash Punctuation 1430
 
0.1%
Open Punctuation 896
 
< 0.1%
Close Punctuation 892
 
< 0.1%
Connector Punctuation 222
 
< 0.1%
Other values (2) 116
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 163257
11.9%
t 148412
10.8%
a 141454
10.3%
i 120775
8.8%
o 117930
8.6%
r 114589
 
8.3%
n 106516
 
7.8%
s 68302
 
5.0%
l 50178
 
3.7%
d 49075
 
3.6%
Other values (16) 291996
21.3%
Uppercase Letter
ValueCountFrequency (%)
S 34581
13.5%
C 32789
12.8%
A 31464
12.3%
N 30702
12.0%
U 20997
 
8.2%
R 9869
 
3.9%
P 9681
 
3.8%
M 8778
 
3.4%
G 8463
 
3.3%
W 8278
 
3.2%
Other values (16) 60558
23.6%
Other Punctuation
ValueCountFrequency (%)
, 13012
43.3%
. 12627
42.1%
' 2358
 
7.9%
: 621
 
2.1%
/ 377
 
1.3%
; 357
 
1.2%
& 293
 
1.0%
# 147
 
0.5%
? 147
 
0.5%
" 46
 
0.2%
Other values (3) 35
 
0.1%
Decimal Number
ValueCountFrequency (%)
1 4716
22.9%
2 2899
14.1%
0 2574
12.5%
5 2354
11.4%
3 2190
10.6%
9 1537
 
7.5%
4 1323
 
6.4%
6 1086
 
5.3%
8 996
 
4.8%
7 950
 
4.6%
Math Symbol
ValueCountFrequency (%)
| 95425
99.4%
~ 537
 
0.6%
= 5
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 871
97.2%
[ 25
 
2.8%
Close Punctuation
ValueCountFrequency (%)
) 867
97.2%
] 25
 
2.8%
Space Separator
ValueCountFrequency (%)
361863
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1430
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 222
100.0%
Other Symbol
ValueCountFrequency (%)
111
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1628644
76.1%
Common 512031
 
23.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 163257
 
10.0%
t 148412
 
9.1%
a 141454
 
8.7%
i 120775
 
7.4%
o 117930
 
7.2%
r 114589
 
7.0%
n 106516
 
6.5%
s 68302
 
4.2%
l 50178
 
3.1%
d 49075
 
3.0%
Other values (42) 548156
33.7%
Common
ValueCountFrequency (%)
361863
70.7%
| 95425
 
18.6%
, 13012
 
2.5%
. 12627
 
2.5%
1 4716
 
0.9%
2 2899
 
0.6%
0 2574
 
0.5%
' 2358
 
0.5%
5 2354
 
0.5%
3 2190
 
0.4%
Other values (25) 12013
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2140564
> 99.9%
Specials 111
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
361863
16.9%
e 163257
 
7.6%
t 148412
 
6.9%
a 141454
 
6.6%
i 120775
 
5.6%
o 117930
 
5.5%
r 114589
 
5.4%
n 106516
 
5.0%
| 95425
 
4.5%
s 68302
 
3.2%
Other values (76) 702041
32.8%
Specials
ValueCountFrequency (%)
111
100.0%
Distinct175
Distinct (%)9.2%
Missing24884
Missing (%)92.9%
Memory size209.4 KiB
2025-01-23T18:14:38.542698image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.145931759
Min length1

Characters and Unicode

Total characters5993
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)2.8%

Sample

1st row649
2nd row649
3rd row649
4th row649
5th row649
ValueCountFrequency (%)
50 252
 
13.2%
350 225
 
11.8%
750 187
 
9.8%
649 84
 
4.4%
530 69
 
3.6%
2000 54
 
2.8%
441 52
 
2.7%
1051 47
 
2.5%
1000 36
 
1.9%
460 35
 
1.8%
Other values (165) 864
45.4%
2025-01-23T18:14:38.771587image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1958
32.7%
5 1070
17.9%
1 662
 
11.0%
4 495
 
8.3%
3 474
 
7.9%
7 405
 
6.8%
6 282
 
4.7%
2 248
 
4.1%
9 223
 
3.7%
8 176
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5993
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1958
32.7%
5 1070
17.9%
1 662
 
11.0%
4 495
 
8.3%
3 474
 
7.9%
7 405
 
6.8%
6 282
 
4.7%
2 248
 
4.1%
9 223
 
3.7%
8 176
 
2.9%

Most occurring scripts

ValueCountFrequency (%)
Common 5993
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1958
32.7%
5 1070
17.9%
1 662
 
11.0%
4 495
 
8.3%
3 474
 
7.9%
7 405
 
6.8%
6 282
 
4.7%
2 248
 
4.1%
9 223
 
3.7%
8 176
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5993
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1958
32.7%
5 1070
17.9%
1 662
 
11.0%
4 495
 
8.3%
3 474
 
7.9%
7 405
 
6.8%
6 282
 
4.7%
2 248
 
4.1%
9 223
 
3.7%
8 176
 
2.9%
Distinct177
Distinct (%)9.3%
Missing24884
Missing (%)92.9%
Memory size209.4 KiB
2025-01-23T18:14:38.939486image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length4
Mean length3.142782152
Min length1

Characters and Unicode

Total characters5987
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique54 ?
Unique (%)2.8%

Sample

1st row649
2nd row649
3rd row649
4th row649
5th row649
ValueCountFrequency (%)
50 252
 
13.2%
350 225
 
11.8%
750 187
 
9.8%
649 84
 
4.4%
530 69
 
3.6%
2000 54
 
2.8%
441 52
 
2.7%
1051 47
 
2.5%
210 39
 
2.0%
460 35
 
1.8%
Other values (167) 861
45.2%
2025-01-23T18:14:39.166810image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1974
33.0%
5 1063
17.8%
1 620
 
10.4%
4 452
 
7.5%
3 438
 
7.3%
7 404
 
6.7%
2 337
 
5.6%
6 299
 
5.0%
9 216
 
3.6%
8 184
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5987
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1974
33.0%
5 1063
17.8%
1 620
 
10.4%
4 452
 
7.5%
3 438
 
7.3%
7 404
 
6.7%
2 337
 
5.6%
6 299
 
5.0%
9 216
 
3.6%
8 184
 
3.1%

Most occurring scripts

ValueCountFrequency (%)
Common 5987
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1974
33.0%
5 1063
17.8%
1 620
 
10.4%
4 452
 
7.5%
3 438
 
7.3%
7 404
 
6.7%
2 337
 
5.6%
6 299
 
5.0%
9 216
 
3.6%
8 184
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5987
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1974
33.0%
5 1063
17.8%
1 620
 
10.4%
4 452
 
7.5%
3 438
 
7.3%
7 404
 
6.7%
2 337
 
5.6%
6 299
 
5.0%
9 216
 
3.6%
8 184
 
3.1%

decimalLatitude
Text

Missing 

Distinct3370
Distinct (%)20.4%
Missing10270
Missing (%)38.3%
Memory size209.4 KiB
2025-01-23T18:14:39.370659image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.579090744
Min length1

Characters and Unicode

Total characters141718
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1748 ?
Unique (%)10.6%

Sample

1st row28.066114
2nd row26.34243
3rd row26.34243
4th row26.612078
5th row26.612078
ValueCountFrequency (%)
30.805235 903
 
5.5%
32.3629 516
 
3.1%
42.4381028 265
 
1.6%
30.7996 257
 
1.6%
18.3641503 251
 
1.5%
18.321112 193
 
1.2%
42.40257 161
 
1.0%
42.45955 158
 
1.0%
31.759664 150
 
0.9%
18.3691667 147
 
0.9%
Other values (3357) 13518
81.8%
2025-01-23T18:14:39.634704image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 18694
13.2%
2 17117
12.1%
. 16484
11.6%
4 13682
9.7%
0 12347
8.7%
1 11854
8.4%
9 10595
7.5%
6 10424
7.4%
5 10230
7.2%
8 9948
7.0%
Other values (2) 10343
7.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 124075
87.6%
Other Punctuation 16484
 
11.6%
Dash Punctuation 1159
 
0.8%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 18694
15.1%
2 17117
13.8%
4 13682
11.0%
0 12347
10.0%
1 11854
9.6%
9 10595
8.5%
6 10424
8.4%
5 10230
8.2%
8 9948
8.0%
7 9184
7.4%
Other Punctuation
ValueCountFrequency (%)
. 16484
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1159
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 141718
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 18694
13.2%
2 17117
12.1%
. 16484
11.6%
4 13682
9.7%
0 12347
8.7%
1 11854
8.4%
9 10595
7.5%
6 10424
7.4%
5 10230
7.2%
8 9948
7.0%
Other values (2) 10343
7.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 141718
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 18694
13.2%
2 17117
12.1%
. 16484
11.6%
4 13682
9.7%
0 12347
8.7%
1 11854
8.4%
9 10595
7.5%
6 10424
7.4%
5 10230
7.2%
8 9948
7.0%
Other values (2) 10343
7.3%

decimalLongitude
Text

Missing 

Distinct3395
Distinct (%)20.6%
Missing10270
Missing (%)38.3%
Memory size209.4 KiB
2025-01-23T18:14:39.840776image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length12
Median length10
Mean length9.564804165
Min length2

Characters and Unicode

Total characters158001
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1792 ?
Unique (%)10.8%

Sample

1st row-82.392643
2nd row-98.23054
3rd row-98.23054
4th row-81.874428
5th row-81.874428
ValueCountFrequency (%)
82.340294 903
 
5.5%
84.9651 516
 
3.1%
76.5094764 265
 
1.6%
82.3459 257
 
1.6%
65.8196417 251
 
1.5%
65.819557 193
 
1.2%
76.33041 161
 
1.0%
76.4595 158
 
1.0%
93.085265 150
 
0.9%
65.7594444 147
 
0.9%
Other values (3385) 13518
81.8%
2025-01-23T18:14:40.108096image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 16506
10.4%
- 14942
9.5%
7 14779
9.4%
1 14421
9.1%
8 14401
9.1%
6 14135
8.9%
9 13684
8.7%
2 12214
7.7%
4 11817
7.5%
3 10893
6.9%
Other values (2) 20209
12.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 126553
80.1%
Other Punctuation 16506
 
10.4%
Dash Punctuation 14942
 
9.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
7 14779
11.7%
1 14421
11.4%
8 14401
11.4%
6 14135
11.2%
9 13684
10.8%
2 12214
9.7%
4 11817
9.3%
3 10893
8.6%
5 10750
8.5%
0 9459
7.5%
Other Punctuation
ValueCountFrequency (%)
. 16506
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14942
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 158001
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
. 16506
10.4%
- 14942
9.5%
7 14779
9.4%
1 14421
9.1%
8 14401
9.1%
6 14135
8.9%
9 13684
8.7%
2 12214
7.7%
4 11817
7.5%
3 10893
6.9%
Other values (2) 20209
12.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 158001
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 16506
10.4%
- 14942
9.5%
7 14779
9.4%
1 14421
9.1%
8 14401
9.1%
6 14135
8.9%
9 13684
8.7%
2 12214
7.7%
4 11817
7.5%
3 10893
6.9%
Other values (2) 20209
12.8%

geodeticDatum
Text

Missing 

Distinct10
Distinct (%)0.1%
Missing10270
Missing (%)38.3%
Memory size209.4 KiB
2025-01-23T18:14:40.179504image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length5
Mean length9.298383679
Min length5

Characters and Unicode

Total characters153600
Distinct characters38
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rownot recorded (forced WGS84)
2nd rownot recorded (forced WGS84)
3rd rownot recorded (forced WGS84)
4th rowWGS84
5th rowWGS84
ValueCountFrequency (%)
wgs84 13387
51.0%
recorded 3013
 
11.5%
forced 3013
 
11.5%
not 3013
 
11.5%
nad27 1914
 
7.3%
nad83 942
 
3.6%
of 169
 
0.6%
1994 169
 
0.6%
australia 169
 
0.6%
datum 169
 
0.6%
Other values (9) 294
 
1.1%
2025-01-23T18:14:40.295494image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
8 14329
 
9.3%
G 13650
 
8.9%
4 13556
 
8.8%
S 13481
 
8.8%
W 13475
 
8.8%
9733
 
6.3%
o 9409
 
6.1%
e 9383
 
6.1%
r 9220
 
6.0%
d 9208
 
6.0%
Other values (28) 38156
24.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 54924
35.8%
Uppercase Letter 49543
32.3%
Decimal Number 33374
21.7%
Space Separator 9733
 
6.3%
Open Punctuation 3013
 
2.0%
Close Punctuation 3013
 
2.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 9409
17.1%
e 9383
17.1%
r 9220
16.8%
d 9208
16.8%
c 6201
11.3%
t 3526
 
6.4%
f 3182
 
5.8%
n 3025
 
5.5%
a 519
 
0.9%
i 356
 
0.6%
Other values (8) 895
 
1.6%
Decimal Number
ValueCountFrequency (%)
8 14329
42.9%
4 13556
40.6%
7 1987
 
6.0%
2 1987
 
6.0%
3 942
 
2.8%
9 344
 
1.0%
1 175
 
0.5%
6 48
 
0.1%
5 6
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
G 13650
27.6%
S 13481
27.2%
W 13475
27.2%
A 3037
 
6.1%
D 3031
 
6.1%
N 2856
 
5.8%
T 7
 
< 0.1%
P 6
 
< 0.1%
Space Separator
ValueCountFrequency (%)
9733
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3013
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3013
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 104467
68.0%
Common 49133
32.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
G 13650
13.1%
S 13481
12.9%
W 13475
12.9%
o 9409
9.0%
e 9383
9.0%
r 9220
8.8%
d 9208
8.8%
c 6201
5.9%
t 3526
 
3.4%
f 3182
 
3.0%
Other values (16) 13732
13.1%
Common
ValueCountFrequency (%)
8 14329
29.2%
4 13556
27.6%
9733
19.8%
( 3013
 
6.1%
) 3013
 
6.1%
7 1987
 
4.0%
2 1987
 
4.0%
3 942
 
1.9%
9 344
 
0.7%
1 175
 
0.4%
Other values (2) 54
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 153600
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
8 14329
 
9.3%
G 13650
 
8.9%
4 13556
 
8.8%
S 13481
 
8.8%
W 13475
 
8.8%
9733
 
6.3%
o 9409
 
6.1%
e 9383
 
6.1%
r 9220
 
6.0%
d 9208
 
6.0%
Other values (28) 38156
24.8%
Distinct1696
Distinct (%)14.6%
Missing15142
Missing (%)56.5%
Memory size209.4 KiB
2025-01-23T18:14:40.482112image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length4
Mean length3.903752039
Min length1

Characters and Unicode

Total characters45467
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique703 ?
Unique (%)6.0%

Sample

1st row118
2nd row118
3rd row4266
4th row4266
5th row118
ValueCountFrequency (%)
2050 903
 
7.8%
35406 501
 
4.3%
542 242
 
2.1%
2414 241
 
2.1%
625 195
 
1.7%
6973 195
 
1.7%
100 157
 
1.3%
1753 156
 
1.3%
1849 150
 
1.3%
47701 150
 
1.3%
Other values (1686) 8757
75.2%
2025-01-23T18:14:40.747401image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6375
14.0%
1 6070
13.4%
2 5898
13.0%
5 5269
11.6%
4 4711
10.4%
3 4384
9.6%
6 4158
9.1%
7 3317
7.3%
9 2717
6.0%
8 2568
5.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 45467
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6375
14.0%
1 6070
13.4%
2 5898
13.0%
5 5269
11.6%
4 4711
10.4%
3 4384
9.6%
6 4158
9.1%
7 3317
7.3%
9 2717
6.0%
8 2568
5.6%

Most occurring scripts

ValueCountFrequency (%)
Common 45467
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6375
14.0%
1 6070
13.4%
2 5898
13.0%
5 5269
11.6%
4 4711
10.4%
3 4384
9.6%
6 4158
9.1%
7 3317
7.3%
9 2717
6.0%
8 2568
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45467
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6375
14.0%
1 6070
13.4%
2 5898
13.0%
5 5269
11.6%
4 4711
10.4%
3 4384
9.6%
6 4158
9.1%
7 3317
7.3%
9 2717
6.0%
8 2568
5.6%

georeferencedBy
Text

Missing 

Distinct2
Distinct (%)< 0.1%
Missing10456
Missing (%)39.0%
Memory size209.4 KiB
2025-01-23T18:14:40.811579image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length16
Mean length16.01793914
Min length16

Characters and Unicode

Total characters261621
Distinct characters19
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCharles M. Dardia
2nd rowCharles M. Dardia
3rd rowCharles M. Dardia
4th rowCharles M. Dardia
5th rowCharles M. Dardia
ValueCountFrequency (%)
agent 16040
32.7%
man 16040
32.7%
secret 16040
32.7%
charles 293
 
0.6%
m 293
 
0.6%
dardia 293
 
0.6%
2025-01-23T18:14:40.923289image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 48413
18.5%
32666
12.5%
n 32080
12.3%
t 32080
12.3%
a 16919
 
6.5%
r 16626
 
6.4%
M 16333
 
6.2%
A 16040
 
6.1%
g 16040
 
6.1%
c 16040
 
6.1%
Other values (9) 18384
 
7.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 179663
68.7%
Uppercase Letter 48999
 
18.7%
Space Separator 32666
 
12.5%
Other Punctuation 293
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 48413
26.9%
n 32080
17.9%
t 32080
17.9%
a 16919
 
9.4%
r 16626
 
9.3%
g 16040
 
8.9%
c 16040
 
8.9%
h 293
 
0.2%
l 293
 
0.2%
s 293
 
0.2%
Other values (2) 586
 
0.3%
Uppercase Letter
ValueCountFrequency (%)
M 16333
33.3%
A 16040
32.7%
S 16040
32.7%
C 293
 
0.6%
D 293
 
0.6%
Space Separator
ValueCountFrequency (%)
32666
100.0%
Other Punctuation
ValueCountFrequency (%)
. 293
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 228662
87.4%
Common 32959
 
12.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 48413
21.2%
n 32080
14.0%
t 32080
14.0%
a 16919
 
7.4%
r 16626
 
7.3%
M 16333
 
7.1%
A 16040
 
7.0%
g 16040
 
7.0%
c 16040
 
7.0%
S 16040
 
7.0%
Other values (7) 2051
 
0.9%
Common
ValueCountFrequency (%)
32666
99.1%
. 293
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 261621
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 48413
18.5%
32666
12.5%
n 32080
12.3%
t 32080
12.3%
a 16919
 
6.5%
r 16626
 
6.4%
M 16333
 
6.2%
A 16040
 
6.1%
g 16040
 
6.1%
c 16040
 
6.1%
Other values (9) 18384
 
7.0%

georeferencedDate
Text

Missing 

Distinct716
Distinct (%)5.6%
Missing13907
Missing (%)51.9%
Memory size209.4 KiB
2025-01-23T18:14:41.126825image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters128820
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)1.2%

Sample

1st row2017-02-22
2nd row2017-02-22
3rd row2016-12-12
4th row2016-12-12
5th row2017-02-22
ValueCountFrequency (%)
2010-12-01 905
 
7.0%
2008-03-13 610
 
4.7%
2008-04-01 520
 
4.0%
2011-11-30 492
 
3.8%
2011-04-11 380
 
2.9%
2007-02-09 259
 
2.0%
2006-06-05 220
 
1.7%
2007-01-04 204
 
1.6%
2008-02-20 203
 
1.6%
2007-11-13 193
 
1.5%
Other values (706) 8896
69.1%
2025-01-23T18:14:41.401054image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 40670
31.6%
- 25764
20.0%
2 20524
15.9%
1 15496
 
12.0%
7 6470
 
5.0%
3 4626
 
3.6%
8 4293
 
3.3%
6 3421
 
2.7%
4 3091
 
2.4%
5 2968
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 103056
80.0%
Dash Punctuation 25764
 
20.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 40670
39.5%
2 20524
19.9%
1 15496
 
15.0%
7 6470
 
6.3%
3 4626
 
4.5%
8 4293
 
4.2%
6 3421
 
3.3%
4 3091
 
3.0%
5 2968
 
2.9%
9 1497
 
1.5%
Dash Punctuation
ValueCountFrequency (%)
- 25764
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 128820
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 40670
31.6%
- 25764
20.0%
2 20524
15.9%
1 15496
 
12.0%
7 6470
 
5.0%
3 4626
 
3.6%
8 4293
 
3.3%
6 3421
 
2.7%
4 3091
 
2.4%
5 2968
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 128820
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 40670
31.6%
- 25764
20.0%
2 20524
15.9%
1 15496
 
12.0%
7 6470
 
5.0%
3 4626
 
3.6%
8 4293
 
3.3%
6 3421
 
2.7%
4 3091
 
2.4%
5 2968
 
2.3%

identificationQualifier
Text

Constant  Missing 

Distinct1
Distinct (%)100.0%
Missing26788
Missing (%)> 99.9%
Memory size209.4 KiB
2025-01-23T18:14:41.458682image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st rowsp.
ValueCountFrequency (%)
sp 1
100.0%
2025-01-23T18:14:41.553722image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 1
33.3%
p 1
33.3%
. 1
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2
66.7%
Other Punctuation 1
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 1
50.0%
p 1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2
66.7%
Common 1
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 1
50.0%
p 1
50.0%
Common
ValueCountFrequency (%)
. 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 1
33.3%
p 1
33.3%
. 1
33.3%
Distinct2244
Distinct (%)8.4%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:41.730186image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length59
Median length38
Mean length20.60683888
Min length3

Characters and Unicode

Total characters552016
Distinct characters54
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique800 ?
Unique (%)3.0%

Sample

1st rowRhinella marina
2nd rowRhinella marina
3rd rowRhinella marina
4th rowRhinella marina
5th rowRhinella marina
ValueCountFrequency (%)
rana 2000
 
3.4%
eleutherodactylus 1588
 
2.7%
hyla 1333
 
2.3%
bufo 1282
 
2.2%
thamnophis 1250
 
2.1%
coqui 943
 
1.6%
sirtalis 900
 
1.5%
desmognathus 819
 
1.4%
viridescens 796
 
1.3%
ambystoma 792
 
1.3%
Other values (1986) 47426
80.2%
2025-01-23T18:14:41.989655image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 53449
 
9.7%
s 48052
 
8.7%
i 44698
 
8.1%
e 37508
 
6.8%
r 35821
 
6.5%
o 35286
 
6.4%
u 34577
 
6.3%
t 33149
 
6.0%
32343
 
5.9%
l 28396
 
5.1%
Other values (44) 168737
30.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 491765
89.1%
Space Separator 32343
 
5.9%
Uppercase Letter 27148
 
4.9%
Close Punctuation 360
 
0.1%
Open Punctuation 360
 
0.1%
Other Punctuation 39
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 53449
10.9%
s 48052
9.8%
i 44698
 
9.1%
e 37508
 
7.6%
r 35821
 
7.3%
o 35286
 
7.2%
u 34577
 
7.0%
t 33149
 
6.7%
l 28396
 
5.8%
n 26527
 
5.4%
Other values (16) 114302
23.2%
Uppercase Letter
ValueCountFrequency (%)
E 2926
10.8%
R 2572
9.5%
A 2493
9.2%
H 2369
8.7%
C 2361
8.7%
P 2300
8.5%
S 2125
7.8%
T 1856
 
6.8%
B 1570
 
5.8%
L 1430
 
5.3%
Other values (13) 5146
19.0%
Space Separator
ValueCountFrequency (%)
32343
100.0%
Close Punctuation
ValueCountFrequency (%)
) 360
100.0%
Open Punctuation
ValueCountFrequency (%)
( 360
100.0%
Other Punctuation
ValueCountFrequency (%)
. 39
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 518913
94.0%
Common 33103
 
6.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 53449
 
10.3%
s 48052
 
9.3%
i 44698
 
8.6%
e 37508
 
7.2%
r 35821
 
6.9%
o 35286
 
6.8%
u 34577
 
6.7%
t 33149
 
6.4%
l 28396
 
5.5%
n 26527
 
5.1%
Other values (39) 141450
27.3%
Common
ValueCountFrequency (%)
32343
97.7%
) 360
 
1.1%
( 360
 
1.1%
. 39
 
0.1%
- 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 552016
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 53449
 
9.7%
s 48052
 
8.7%
i 44698
 
8.1%
e 37508
 
6.8%
r 35821
 
6.5%
o 35286
 
6.4%
u 34577
 
6.3%
t 33149
 
6.0%
32343
 
5.9%
l 28396
 
5.1%
Other values (44) 168737
30.6%
Distinct588
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Memory size209.4 KiB
2025-01-23T18:14:42.187967image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length79
Median length74
Mean length64.60252342
Min length13

Characters and Unicode

Total characters1730637
Distinct characters51
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique156 ?
Unique (%)0.6%

Sample

1st rowAnimalia | Chordata | Amphibia | Anura | Bufonidae | Rhinella
2nd rowAnimalia | Chordata | Amphibia | Anura | Bufonidae | Rhinella
3rd rowAnimalia | Chordata | Amphibia | Anura | Bufonidae | Rhinella
4th rowAnimalia | Chordata | Amphibia | Anura | Bufonidae | Rhinella
5th rowAnimalia | Chordata | Amphibia | Anura | Bufonidae | Rhinella
ValueCountFrequency (%)
133945
45.5%
animalia 26788
 
9.1%
chordata 26788
 
9.1%
amphibia 13684
 
4.7%
reptilia 13032
 
4.4%
squamata 11783
 
4.0%
anura 9174
 
3.1%
colubridae 5162
 
1.8%
caudata 4483
 
1.5%
hylidae 2650
 
0.9%
Other values (656) 46643
 
15.9%
2025-01-23T18:14:42.460475image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
267635
15.5%
a 250600
14.5%
i 151280
 
8.7%
| 133945
 
7.7%
t 81517
 
4.7%
d 71381
 
4.1%
l 66676
 
3.9%
e 65926
 
3.8%
o 64469
 
3.7%
m 62885
 
3.6%
Other values (41) 514323
29.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1168870
67.5%
Space Separator 267635
 
15.5%
Uppercase Letter 160187
 
9.3%
Math Symbol 133945
 
7.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 250600
21.4%
i 151280
12.9%
t 81517
 
7.0%
d 71381
 
6.1%
l 66676
 
5.7%
e 65926
 
5.6%
o 64469
 
5.5%
m 62885
 
5.4%
n 59614
 
5.1%
r 58404
 
5.0%
Other values (16) 236118
20.2%
Uppercase Letter
ValueCountFrequency (%)
A 53460
33.4%
C 39161
24.4%
R 17674
 
11.0%
S 16493
 
10.3%
H 5555
 
3.5%
P 5208
 
3.3%
E 3755
 
2.3%
T 3585
 
2.2%
B 3284
 
2.1%
L 2829
 
1.8%
Other values (13) 9183
 
5.7%
Space Separator
ValueCountFrequency (%)
267635
100.0%
Math Symbol
ValueCountFrequency (%)
| 133945
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1329057
76.8%
Common 401580
 
23.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 250600
18.9%
i 151280
 
11.4%
t 81517
 
6.1%
d 71381
 
5.4%
l 66676
 
5.0%
e 65926
 
5.0%
o 64469
 
4.9%
m 62885
 
4.7%
n 59614
 
4.5%
r 58404
 
4.4%
Other values (39) 396305
29.8%
Common
ValueCountFrequency (%)
267635
66.6%
| 133945
33.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1730637
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
267635
15.5%
a 250600
14.5%
i 151280
 
8.7%
| 133945
 
7.7%
t 81517
 
4.7%
d 71381
 
4.1%
l 66676
 
3.9%
e 65926
 
3.8%
o 64469
 
3.7%
m 62885
 
3.6%
Other values (41) 514323
29.7%

kingdom
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:42.516629image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters214304
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimalia
2nd rowAnimalia
3rd rowAnimalia
4th rowAnimalia
5th rowAnimalia
ValueCountFrequency (%)
animalia 26788
100.0%
2025-01-23T18:14:42.613820image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 53576
25.0%
a 53576
25.0%
A 26788
12.5%
n 26788
12.5%
m 26788
12.5%
l 26788
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 187516
87.5%
Uppercase Letter 26788
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 53576
28.6%
a 53576
28.6%
n 26788
14.3%
m 26788
14.3%
l 26788
14.3%
Uppercase Letter
ValueCountFrequency (%)
A 26788
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 214304
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 53576
25.0%
a 53576
25.0%
A 26788
12.5%
n 26788
12.5%
m 26788
12.5%
l 26788
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 214304
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 53576
25.0%
a 53576
25.0%
A 26788
12.5%
n 26788
12.5%
m 26788
12.5%
l 26788
12.5%

phylum
Text

Distinct2
Distinct (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:42.657264image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.00007466
Min length8

Characters and Unicode

Total characters214306
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowChordata
2nd rowChordata
3rd rowChordata
4th rowChordata
5th rowChordata
ValueCountFrequency (%)
chordata 26787
> 99.9%
arthropoda 1
 
< 0.1%
2025-01-23T18:14:42.766433image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 53575
25.0%
o 26789
12.5%
r 26789
12.5%
h 26788
12.5%
d 26788
12.5%
t 26788
12.5%
C 26787
12.5%
A 1
 
< 0.1%
p 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 187518
87.5%
Uppercase Letter 26788
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 53575
28.6%
o 26789
14.3%
r 26789
14.3%
h 26788
14.3%
d 26788
14.3%
t 26788
14.3%
p 1
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
C 26787
> 99.9%
A 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 214306
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 53575
25.0%
o 26789
12.5%
r 26789
12.5%
h 26788
12.5%
d 26788
12.5%
t 26788
12.5%
C 26787
12.5%
A 1
 
< 0.1%
p 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 214306
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 53575
25.0%
o 26789
12.5%
r 26789
12.5%
h 26788
12.5%
d 26788
12.5%
t 26788
12.5%
C 26787
12.5%
A 1
 
< 0.1%
p 1
 
< 0.1%

class
Text

Distinct4
Distinct (%)< 0.1%
Missing73
Missing (%)0.3%
Memory size209.4 KiB
2025-01-23T18:14:42.811720image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length8
Mean length8.000187154
Min length7

Characters and Unicode

Total characters213733
Distinct characters19
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowAmphibia
2nd rowAmphibia
3rd rowAmphibia
4th rowAmphibia
5th rowAmphibia
ValueCountFrequency (%)
amphibia 13683
51.2%
reptilia 13031
48.8%
actinopterygii 1
 
< 0.1%
insecta 1
 
< 0.1%
2025-01-23T18:14:42.920511image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 53431
25.0%
p 26715
12.5%
a 26715
12.5%
A 13684
 
6.4%
m 13683
 
6.4%
h 13683
 
6.4%
b 13683
 
6.4%
t 13034
 
6.1%
e 13033
 
6.1%
R 13031
 
6.1%
Other values (9) 13041
 
6.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 187017
87.5%
Uppercase Letter 26716
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 53431
28.6%
p 26715
14.3%
a 26715
14.3%
m 13683
 
7.3%
h 13683
 
7.3%
b 13683
 
7.3%
t 13034
 
7.0%
e 13033
 
7.0%
l 13031
 
7.0%
c 2
 
< 0.1%
Other values (6) 7
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
A 13684
51.2%
R 13031
48.8%
I 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 213733
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 53431
25.0%
p 26715
12.5%
a 26715
12.5%
A 13684
 
6.4%
m 13683
 
6.4%
h 13683
 
6.4%
b 13683
 
6.4%
t 13034
 
6.1%
e 13033
 
6.1%
R 13031
 
6.1%
Other values (9) 13041
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 213733
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 53431
25.0%
p 26715
12.5%
a 26715
12.5%
A 13684
 
6.4%
m 13683
 
6.4%
h 13683
 
6.4%
b 13683
 
6.4%
t 13034
 
6.1%
e 13033
 
6.1%
R 13031
 
6.1%
Other values (9) 13041
 
6.1%

order
Text

Distinct9
Distinct (%)< 0.1%
Missing85
Missing (%)0.3%
Memory size209.4 KiB
2025-01-23T18:14:42.972165image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length15
Mean length6.897768125
Min length5

Characters and Unicode

Total characters184198
Distinct characters25
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st rowAnura
2nd rowAnura
3rd rowAnura
4th rowAnura
5th rowAnura
ValueCountFrequency (%)
squamata 11782
44.1%
anura 9174
34.4%
caudata 4488
 
16.8%
testudines 1158
 
4.3%
crocodylia 79
 
0.3%
gymnophiona 14
 
0.1%
rhynchocephalia 7
 
< 0.1%
batrachoidiformes 1
 
< 0.1%
coleoptera 1
 
< 0.1%
2025-01-23T18:14:43.082465image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 58094
31.5%
u 26602
14.4%
t 17430
 
9.5%
m 11797
 
6.4%
S 11782
 
6.4%
q 11782
 
6.4%
n 10367
 
5.6%
r 9256
 
5.0%
A 9174
 
5.0%
d 5726
 
3.1%
Other values (15) 12188
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 157494
85.5%
Uppercase Letter 26704
 
14.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 58094
36.9%
u 26602
16.9%
t 17430
 
11.1%
m 11797
 
7.5%
q 11782
 
7.5%
n 10367
 
6.6%
r 9256
 
5.9%
d 5726
 
3.6%
e 2326
 
1.5%
s 2317
 
1.5%
Other values (8) 1797
 
1.1%
Uppercase Letter
ValueCountFrequency (%)
S 11782
44.1%
A 9174
34.4%
C 4568
 
17.1%
T 1158
 
4.3%
G 14
 
0.1%
R 7
 
< 0.1%
B 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 184198
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 58094
31.5%
u 26602
14.4%
t 17430
 
9.5%
m 11797
 
6.4%
S 11782
 
6.4%
q 11782
 
6.4%
n 10367
 
5.6%
r 9256
 
5.0%
A 9174
 
5.0%
d 5726
 
3.1%
Other values (15) 12188
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 184198
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 58094
31.5%
u 26602
14.4%
t 17430
 
9.5%
m 11797
 
6.4%
S 11782
 
6.4%
q 11782
 
6.4%
n 10367
 
5.6%
r 9256
 
5.0%
A 9174
 
5.0%
d 5726
 
3.1%
Other values (15) 12188
 
6.6%

family
Text

Distinct118
Distinct (%)0.4%
Missing214
Missing (%)0.8%
Memory size209.4 KiB
2025-01-23T18:14:43.201640image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length16
Mean length10.89945437
Min length6

Characters and Unicode

Total characters289653
Distinct characters41
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique12 ?
Unique (%)< 0.1%

Sample

1st rowBufonidae
2nd rowBufonidae
3rd rowBufonidae
4th rowBufonidae
5th rowBufonidae
ValueCountFrequency (%)
colubridae 5163
19.4%
hylidae 2649
10.0%
plethodontidae 2632
9.9%
ranidae 2011
 
7.6%
phrynosomatidae 1723
 
6.5%
bufonidae 1624
 
6.1%
eleutherodactylidae 1588
 
6.0%
scincidae 1551
 
5.8%
ambystomatidae 792
 
3.0%
salamandridae 733
 
2.8%
Other values (108) 6109
23.0%
2025-01-23T18:14:43.394937image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 37160
12.8%
e 35471
12.2%
d 32936
11.4%
i 31234
10.8%
o 21179
 
7.3%
l 16309
 
5.6%
t 13593
 
4.7%
n 12071
 
4.2%
r 12069
 
4.2%
y 9144
 
3.2%
Other values (31) 68487
23.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 263078
90.8%
Uppercase Letter 26575
 
9.2%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 37160
14.1%
e 35471
13.5%
d 32936
12.5%
i 31234
11.9%
o 21179
8.1%
l 16309
 
6.2%
t 13593
 
5.2%
n 12071
 
4.6%
r 12069
 
4.6%
y 9144
 
3.5%
Other values (12) 41912
15.9%
Uppercase Letter
ValueCountFrequency (%)
C 5612
21.1%
P 4630
17.4%
H 3224
12.1%
S 2652
10.0%
E 2381
9.0%
R 2040
 
7.7%
B 1729
 
6.5%
A 1419
 
5.3%
V 749
 
2.8%
T 592
 
2.2%
Other values (9) 1547
 
5.8%

Most occurring scripts

ValueCountFrequency (%)
Latin 289653
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 37160
12.8%
e 35471
12.2%
d 32936
11.4%
i 31234
10.8%
o 21179
 
7.3%
l 16309
 
5.6%
t 13593
 
4.7%
n 12071
 
4.2%
r 12069
 
4.2%
y 9144
 
3.2%
Other values (31) 68487
23.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 289653
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 37160
12.8%
e 35471
12.2%
d 32936
11.4%
i 31234
10.8%
o 21179
 
7.3%
l 16309
 
5.6%
t 13593
 
4.7%
n 12071
 
4.2%
r 12069
 
4.2%
y 9144
 
3.2%
Other values (31) 68487
23.6%

genus
Text

Distinct542
Distinct (%)2.0%
Missing259
Missing (%)1.0%
Memory size209.4 KiB
2025-01-23T18:14:43.587687image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.689295138
Min length3

Characters and Unicode

Total characters230527
Distinct characters49
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique135 ?
Unique (%)0.5%

Sample

1st rowRhinella
2nd rowRhinella
3rd rowRhinella
4th rowRhinella
5th rowRhinella
ValueCountFrequency (%)
rana 2000
 
7.5%
eleutherodactylus 1588
 
6.0%
hyla 1333
 
5.0%
bufo 1274
 
4.8%
thamnophis 1250
 
4.7%
desmognathus 819
 
3.1%
ambystoma 792
 
3.0%
sceloporus 791
 
3.0%
pseudacris 747
 
2.8%
nerodia 646
 
2.4%
Other values (532) 15290
57.6%
2025-01-23T18:14:43.844209image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 21442
 
9.3%
o 20202
 
8.8%
s 18940
 
8.2%
e 16486
 
7.2%
u 14141
 
6.1%
r 13728
 
6.0%
t 12821
 
5.6%
l 12360
 
5.4%
i 11892
 
5.2%
h 11332
 
4.9%
Other values (39) 77183
33.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 203997
88.5%
Uppercase Letter 26530
 
11.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 21442
10.5%
o 20202
 
9.9%
s 18940
 
9.3%
e 16486
 
8.1%
u 14141
 
6.9%
r 13728
 
6.7%
t 12821
 
6.3%
l 12360
 
6.1%
i 11892
 
5.8%
h 11332
 
5.6%
Other values (16) 50653
24.8%
Uppercase Letter
ValueCountFrequency (%)
E 2943
11.1%
R 2566
9.7%
A 2412
9.1%
H 2361
8.9%
C 2344
8.8%
P 2276
8.6%
S 2117
8.0%
T 1825
 
6.9%
B 1569
 
5.9%
N 1379
 
5.2%
Other values (13) 4738
17.9%

Most occurring scripts

ValueCountFrequency (%)
Latin 230527
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 21442
 
9.3%
o 20202
 
8.8%
s 18940
 
8.2%
e 16486
 
7.2%
u 14141
 
6.1%
r 13728
 
6.0%
t 12821
 
5.6%
l 12360
 
5.4%
i 11892
 
5.2%
h 11332
 
4.9%
Other values (39) 77183
33.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 230527
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 21442
 
9.3%
o 20202
 
8.8%
s 18940
 
8.2%
e 16486
 
7.2%
u 14141
 
6.1%
r 13728
 
6.0%
t 12821
 
5.6%
l 12360
 
5.4%
i 11892
 
5.2%
h 11332
 
4.9%
Other values (39) 77183
33.5%

specificEpithet
Text

Missing 

Distinct1215
Distinct (%)4.7%
Missing945
Missing (%)3.5%
Memory size209.4 KiB
2025-01-23T18:14:44.005649image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length16
Mean length8.804016406
Min length3

Characters and Unicode

Total characters227531
Distinct characters28
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique356 ?
Unique (%)1.4%

Sample

1st rowmarina
2nd rowmarina
3rd rowmarina
4th rowmarina
5th rowmarina
ValueCountFrequency (%)
coqui 943
 
3.6%
sirtalis 579
 
2.2%
ebraccata 531
 
2.1%
viridescens 515
 
2.0%
crucifer 514
 
2.0%
fuscus 457
 
1.8%
clamitans 402
 
1.6%
maculatum 402
 
1.6%
sipedon 401
 
1.6%
pipiens 386
 
1.5%
Other values (1205) 20716
80.2%
2025-01-23T18:14:44.231766image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 25667
11.3%
i 25524
11.2%
s 22534
9.9%
r 17122
 
7.5%
u 16985
 
7.5%
e 16447
 
7.2%
t 16153
 
7.1%
c 14536
 
6.4%
n 13145
 
5.8%
l 12949
 
5.7%
Other values (18) 46469
20.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 227527
> 99.9%
Space Separator 3
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 25667
11.3%
i 25524
11.2%
s 22534
9.9%
r 17122
 
7.5%
u 16985
 
7.5%
e 16447
 
7.2%
t 16153
 
7.1%
c 14536
 
6.4%
n 13145
 
5.8%
l 12949
 
5.7%
Other values (16) 46465
20.4%
Space Separator
ValueCountFrequency (%)
3
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 227527
> 99.9%
Common 4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 25667
11.3%
i 25524
11.2%
s 22534
9.9%
r 17122
 
7.5%
u 16985
 
7.5%
e 16447
 
7.2%
t 16153
 
7.1%
c 14536
 
6.4%
n 13145
 
5.8%
l 12949
 
5.7%
Other values (16) 46465
20.4%
Common
ValueCountFrequency (%)
3
75.0%
- 1
 
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 227531
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 25667
11.3%
i 25524
11.2%
s 22534
9.9%
r 17122
 
7.5%
u 16985
 
7.5%
e 16447
 
7.2%
t 16153
 
7.1%
c 14536
 
6.4%
n 13145
 
5.8%
l 12949
 
5.7%
Other values (18) 46469
20.4%

infraspecificEpithet
Text

Missing 

Distinct479
Distinct (%)7.9%
Missing20731
Missing (%)77.4%
Memory size209.4 KiB
2025-01-23T18:14:44.436877image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length30
Median length18
Mean length9.110267415
Min length4

Characters and Unicode

Total characters55190
Distinct characters28
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique144 ?
Unique (%)2.4%

Sample

1st rowbislineata
2nd rowfuscus
3rd rowbivittatus
4th rowsirtalis
5th rowsirtalis
ValueCountFrequency (%)
sirtalis 321
 
5.2%
viridescens 281
 
4.6%
sipedon 193
 
3.1%
dorsalis 190
 
3.1%
bislineata 139
 
2.3%
triangulum 132
 
2.2%
americanus 122
 
2.0%
fuscus 113
 
1.8%
edwardsii 100
 
1.6%
constrictor 100
 
1.6%
Other values (466) 4445
72.4%
2025-01-23T18:14:44.706411image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 6889
12.5%
s 6194
11.2%
a 5636
10.2%
r 4683
 
8.5%
e 3924
 
7.1%
n 3388
 
6.1%
u 3339
 
6.1%
t 3212
 
5.8%
l 3036
 
5.5%
o 2916
 
5.3%
Other values (18) 11973
21.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 55073
99.8%
Space Separator 79
 
0.1%
Other Punctuation 38
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 6889
12.5%
s 6194
11.2%
a 5636
10.2%
r 4683
 
8.5%
e 3924
 
7.1%
n 3388
 
6.2%
u 3339
 
6.1%
t 3212
 
5.8%
l 3036
 
5.5%
o 2916
 
5.3%
Other values (16) 11856
21.5%
Space Separator
ValueCountFrequency (%)
79
100.0%
Other Punctuation
ValueCountFrequency (%)
. 38
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 55073
99.8%
Common 117
 
0.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 6889
12.5%
s 6194
11.2%
a 5636
10.2%
r 4683
 
8.5%
e 3924
 
7.1%
n 3388
 
6.2%
u 3339
 
6.1%
t 3212
 
5.8%
l 3036
 
5.5%
o 2916
 
5.3%
Other values (16) 11856
21.5%
Common
ValueCountFrequency (%)
79
67.5%
. 38
32.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 55190
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 6889
12.5%
s 6194
11.2%
a 5636
10.2%
r 4683
 
8.5%
e 3924
 
7.1%
n 3388
 
6.1%
u 3339
 
6.1%
t 3212
 
5.8%
l 3036
 
5.5%
o 2916
 
5.3%
Other values (18) 11973
21.7%
Distinct7
Distinct (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:44.769369image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length7
Mean length7.612326415
Min length5

Characters and Unicode

Total characters203919
Distinct characters18
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowspecies
2nd rowspecies
3rd rowspecies
4th rowspecies
5th rowspecies
ValueCountFrequency (%)
species 19786
73.9%
subspecies 6058
 
22.6%
genus 686
 
2.6%
order 129
 
0.5%
phylum 72
 
0.3%
family 45
 
0.2%
class 12
 
< 0.1%
2025-01-23T18:14:44.872613image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 58456
28.7%
e 52503
25.7%
p 25916
12.7%
i 25889
12.7%
c 25856
12.7%
u 6816
 
3.3%
b 6058
 
3.0%
g 686
 
0.3%
n 686
 
0.3%
r 258
 
0.1%
Other values (8) 795
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 203919
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 58456
28.7%
e 52503
25.7%
p 25916
12.7%
i 25889
12.7%
c 25856
12.7%
u 6816
 
3.3%
b 6058
 
3.0%
g 686
 
0.3%
n 686
 
0.3%
r 258
 
0.1%
Other values (8) 795
 
0.4%

Most occurring scripts

ValueCountFrequency (%)
Latin 203919
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 58456
28.7%
e 52503
25.7%
p 25916
12.7%
i 25889
12.7%
c 25856
12.7%
u 6816
 
3.3%
b 6058
 
3.0%
g 686
 
0.3%
n 686
 
0.3%
r 258
 
0.1%
Other values (8) 795
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 203919
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 58456
28.7%
e 52503
25.7%
p 25916
12.7%
i 25889
12.7%
c 25856
12.7%
u 6816
 
3.3%
b 6058
 
3.0%
g 686
 
0.3%
n 686
 
0.3%
r 258
 
0.1%
Other values (8) 795
 
0.4%

nomenclaturalCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size209.4 KiB
2025-01-23T18:14:44.917444image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters107152
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowICZN
2nd rowICZN
3rd rowICZN
4th rowICZN
5th rowICZN
ValueCountFrequency (%)
iczn 26788
100.0%
2025-01-23T18:14:45.012581image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 26788
25.0%
C 26788
25.0%
Z 26788
25.0%
N 26788
25.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 107152
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 26788
25.0%
C 26788
25.0%
Z 26788
25.0%
N 26788
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 107152
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 26788
25.0%
C 26788
25.0%
Z 26788
25.0%
N 26788
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 107152
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 26788
25.0%
C 26788
25.0%
Z 26788
25.0%
N 26788
25.0%